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Preface 



In the fall of 1987, I taught a graduate computer science course entitled 
"Symbolic Computational Algebra" at New York University. A rough set 
of class-notes grew out of this class and evolved into the following final 
form at an excruciatingly slow pace over the last five years. This book also 
benefited from the comments and experience of several people, some of 
whom used the notes in various computer science and mathematics courses 
at Carnegie-Mellon, Cornell, Princeton and UC Berkeley. 

The book is meant for graduate students with a training in theoretical 
computer science, who would like to either do research in computational 
algebra or understand the algorithmic underpinnings of various commer- 
cial symbolic computational systems: Mathematica, Maple or Axiom, for 
instance. Also, it is hoped that other researchers in the robotics, solid 
modeling, computational geometry and automated theorem proving com- 
munities will find it useful as symbolic algebraic techniques have begun to 
play an important role in these areas. 

The main four topics-Grobner bases, characteristic sets, resultants and 
semialgebraic sets-were picked to reflect my original motivation. The choice 
of the topics was partly influenced by the syllabii proposed by the Research 
Institute for Symbolic Computation in Linz, Austria, and the discussions 
in Hcarn's Report ("Future Directions for Research in Symbolic Computa- 
tion"). 

The book is meant to be covered in a one-semester graduate course 
comprising about fifteen lectures. The book assumes very little background 
other than what most beginning computer science graduate students have. 
For these reasons, I have attempted to keep the book self-contained and 
largely focussed on the very basic materials. 

Since 1987, there has been an explosion of new ideas and techniques 
in all the areas covered here (e.g., better complexity analysis of Grobner 
basis algorithms, many new applications, effective Nullstellensatz, multi- 
variate resultants, generalized characteristic polynomial, new stratification 
algorithms for semialgebraic sets, faster quantifier elimination algorithm 
for Tarski sentences, etc.). However, none of these new topics could be in- 
cluded here without distracting from my original intention. It is hoped that 
this book will prepare readers to be able to study these topics on their own. 
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Also, there have been several new textbooks in the area (by Akritas, 
Davenport, Siret and Tournier, and Mignotte) and there are a few more 
on the way (by Eiscnbaud, Robbiano, Weispfenning and Becker, Yap, and 
Zippel). All these books and the current book emphasize different mate- 
rials, involve different degrees of depth and address different readerships. 
An instructor, if he or she so desires, may choose to supplement the cur- 
rent book by some of these other books in order to bring in such topics as 
factorization, number-theoretic or group-theoretic algorithms, integration 
or differential algebra. 

The author is grateful to many of his colleagues at NYU and elsewhere 
for their support, encouragement, help and advice. Namely, J. Canny, E.M. 
Clarke, B. Chazelle, M. Davis, H.M. Edwards, A. Frieze, J. Gutierrez, 
D. Kozcn, R. Pollack, D. Scott, J. Spencer and C-K. Yap. I have also 
benefited from close research collaboration with my colleague C-K. Yap 
and my graduate students G. Gallo and P. Pedersen. Several students in 
my class have helped me in transcribing the original notes and in preparing 
some of the solutions to the exercises: P. Agarwal, G. Gallo, T. Johnson, 
N. Oliver, P. Pedersen, R. Sundar, M. Tcichman and P. Tetali. 

I also thank my editors at Springer for their patience and support. 
During the preparation of this book I had been supported by NSF and 
ONR and I am gratified by the interest of my program officers: Kamal 
Abdali and Ralph Wachter. 

I would like to express my gratitude to Prof. Bill Wulf for his efforts to 
perform miracles on my behalf during many of my personal and professional 
crises. I would also like to thank my colleague Thomas Anantharaman for 
reminding me of the power of intuition and for his friendship. Thanks are 
due to Robin Mahapatra for his constant interest. 

In the first draft of this manuscript, I had thanked my imaginary wife 
for keeping my hypothetical sons out of my nonexistent hair. In the interim 
five years, I have gained a wife Jane and two sons Sam and Tom, necessarily 
in that order-but, alas, no hair. To them, I owe my deepest gratitude for 
their understanding. 

Last but not least, I thank Dick Aynes without whose unkind help this 
book would have gone to press some four years ago. 



B. Mishra 
mishra@nyu.edu. arpa 
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Introduction 



1.1 Prologue: Algebra and Algorithms 

The birth and growth of both algebra and algorithms are strongly inter- 
twined. The origins of both disciplines are usually traced back to Muha- 
mmed ibn-Musa al-Khwarizmi al-Quturbulli, who was a prominent figure 
in the court of Caliph Al-Mamun of the Abassid dynasty in Baghdad (813- 
833 A.D.). Al-Khwarizmi's contribution to Arabic and thus eventually to 
Western (i.e., modern) mathematics is manifold: his was one of the first 
efforts to synthesize Greek axiomatic mathematics with the Hindu algo- 
rithmic mathematics. The results were the popularization of Hindu nu- 
merals, decimal representation, computation with symbols, etc. His tome 
"al- Jabr wal-Muqabala," which was eventually translated into Latin by the 
Englishman Robert of Chester under the title "Dicit Algoritmi," gave rise 
to the terms algebra (a corruption of "al-Jabr") and algorithm (a corrup- 
tion of the word "al-Khwarizmi"). 

However, the two subjects developed at a rather different rate, between 
two different communities. While the discipline of algorithms remained in 
its suspended infancy for years, the subject of algebra grew at a prodigious 
rate, and was soon to dominate most of mathematics. 

The formulation of geometry in an algebraic setup was facilitated by 
the introduction of coordinate geometry by the French mathematician 
Descartes, and algebra caught the attention of the prominent mathemati- 
cians of the era. The late nineteenth century saw the function-theoretic and 
topological approach of Riemann, the more geometric approach of Brill and 
Noether, and the purely algebraic approach of Kronecker, Dedekind and 
Weber. The subject grew richer and deeper, with the work of many illus- 
trious algebraists and algebraic geometers: Newton, Tschirnhausen, Euler, 
Jacobi, Sylvester, Riemann, Cayley, Kronecker, Dedekind, Noether, Cre- 
mona, Bertini, Segre, Castelnuovo, Enriques, Severi, Poincare, Hurwitz, 
Macaulay, Hilbert, Weil, Zariski, Hodge, Artin, Chevally, Kodaira, van der 
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Waerden, Hironaka, Abhyankar, Serre, Grothendieck, Mumford, Griffiths 
and many others. 

But soon algebra also lost its constructive foundation, so prominent in 
the work of Newton, Tschirnhausen, Kronecker and Sylvester, and thereby 
its role as a computational tool. For instance, under Bourbaki's influence, 
it became fashionable to bring into disrepute the beautiful and constructive 
elimination theory, developed over half a century by Sylvester, Kronecker, 
Mertens, Konig, Hurwitz and Macaulay. The revival of the field of con- 
structive algebra is a rather recent phenomenon, and owes a good deal 
to the work of Tarski, Seidenberg, Ritt, Collins, Hironaka, Buchberger, 
Bishop, Richman and others. The views of a constructive algebraist are 
closest to the ones we will take in the book. These views were rather 
succinctly described by Hensel in the preface to Kronecker's lectures on 
number theory: 

[Kronecker] believed that one could, and that one must, in these 
parts of mathematics, frame each definition in such a way that one 
can test in a finite number of steps whether it applies to any given 
quantity. In the same way, a proof of the existence of a quantity 
can only be regarded as fully rigorous when it contains a method by 
which the quantity whose existence is to be proved can actually be 
found. 

The views of constructive algebraists are far from the accepted dogmas 
of modern mathematics. As Harold M. Edwards [68] put it: "Kronecker's 
views are so antithetical to the prevailing views that the natural way for 
most modern mathematicians to describe them is to use the word 'heresy'." 

Now turning to the science of algorithms, we see that although for 
many centuries there was much interest in mechanizing the computation 
process, in the absence of a practical computer, there was no incentive 
to study general-purpose algorithms. In the 1670's, Gottfried Leibnitz in- 
vented his so-called "Leibnitz Wheel," which could add, subtract, multiply 
and divide. On the subject of mechanization of computation, Leibnitz said 
([192], pp. 180-181): 

And now that we may give final praise to the machine we may 
say that it will be desirable to all who are engaged in computa- 
tions... managers of financial affairs, merchants, surveyors, geogra- 
phers, navigators, astronomers.... But limiting ourselves to scientific 
uses, the old geometric and astronomic tables could be corrected and 
new ones constructed.. ..Also, the astronomers surely will not have 
to continue to exercise the patience which is required for computa- 
tion.... For it is unworthy of excellent men to lose hours like slaves in 
the labor of computation. 

Leibnitz also sought a characteristica generalis, a symbolic language, to 
be used in the translation of mathematical methods and statements into 
algorithms and formulas. Many of Leibnitz's other ideas, namely, the bi- 
nary number system, calculus ratiocanator or calculus of reason, and lingua 
characteristica, a universal language for mathematical discourse, were to 
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influence modern-day computers, computation and logical reasoning. The 
basic notions in calculus ratiocanator led to Boolean algebra, which, in 
turn, formed the foundations for logic design, as developed by C. Shannon. 

However, the technology of the time was inadequate for devising a prac- 
tical computer. The best computational device Leibnitz could foresee was 
a "learned committee" sitting around a table and saying: 

"Lasst uns rechnen!" 

In the nineteenth century, Charles Babbage conceived (but never con- 
structed) a powerful calculating machine, which he called an analytical 
engine. The proposed machine was to be an all-purpose automatic de- 
vice, capable of handling problems in algebra and mathematical analysis; 
in fact, of its power, Babbage said that "it could do everything but compose 
country dances." [102] 

Except for these developments and a few others of similar nature, the 
science of computation and algorithms remained mostly neglected in the 
last century. In this century, essentially two events breathed life into these 
subjects: One was the study concerning the foundations of mathematics, 
as established in "Hilbert's program," and this effort resulted in Godel's in- 
completeness theorems, various computational models put forth by Church, 
Turing, Markov and Post, the interrelatedness of these models, the ex- 
istence of a "universal" machine and the problem of computability (the 
Entsheidungsproblem) . The other event was the advent of modern high- 
speed digital computers in the postwar period. During the Second World 
War, the feasibility of a large-scale computing machine was demonstrated 
by Colossus in the U.K. (under M.H.A. Newman) and the ENIAC in the 
U.S.A. (under von Neumann, Eckert and Mauchly). After the war, a large 
number of more and more powerful digital computers were developed, start- 
ing with the design of EDVAC in the U.S.A. and Pilot ACE and DEDUCE 
in the U.K. 

Initially, the problems handled by these machines were purely numerical 
in nature, but soon it was realized that these computers could manipulate 
and compute with purely symbolic objects. It is amusing to observe that 
this had not escaped one of the earliest "computer scientists," Lady Ada 
Augusta, Countess Lovelace. She wrote [102], while describing the capa- 
bilities of Babbage's analytical engine, 

Many persons who are not conversant with mathematical studies 
imagine that because the business of [Babbage's analytical engine] 
is to give its results in numerical notation, the nature of its process 
must consequently be arithmetical rather than algebraic and ana- 
lytical. This is an error. The engine can arrange and combine its 
numerical quantities exactly as if they were letters or any other gen- 
eral symbols; and, in fact, it might bring out its results in algebraic 
notation were provisions made accordingly. 
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The next major step was the creation of general-purpose programming 
languages in various forms: as instructions, introduced by Post; as produc- 
tions, independently introduced by Chomsky and Backus; and as functions, 
as introduced by Church in A-calculus. This was quickly followed by the de- 
velopment of more powerful list processing languages by Newell and Simon 
of Carnegie-Mellon University, and later the language Lisp by McCarthy 
at M.I.T. The language Lisp played a key role in the rapid development of 
the subjects of artificial intelligence (AI) and symbolic mathematical com- 
putation. In 1953, some of the very first symbolic computational systems 
were developed by Nolan of M.I.T. and Kahrimanian of Temple University. 

In parallel, the science of design and complexity analysis of discrete 
combinatorial algorithms has grown at an unprecedented rate in the last 
three decades, influenced by the works of Dijkstra, Knuth, Scott, Floyd, 
Hoare, Minsky, Rabin, Cook, Hopcroft, Karp, Tarjan, Hartmanis, Stern, 
Davis, Schwartz, Pippenger, Blum, Aho, Ullman, Yao and others. Other 
areas such as computational geometry, computational number theory, etc. 
have emerged in recent times, and have enriched the subject of algorithms. 
The field of computational algebra and algebraic geometry is a relative 
newcomer, but holds the promise of adding a new dimension to the subject 
of algorithms. 

After a millennium, it appears that the subjects of algorithms and alge- 
bra may finally converge and coexist in a fruitful symbiosis. We conclude 
this section with the following quote from Edwards [68]: 

I believe that Kronecker's best hope of survival comes from a different 
tendency in the mathematics of our day...., namely, the tendency, 
fostered by the advent of computers, toward algorithmic thinking.... 
One has to ask oneself which examples can be tested on a computer, a 
question which forces one to consider concrete algorithms and to try 
to make them efficient. Because of this and because algorithms have 
real-life applications of considerable importance, the development of 
algorithms has become a respectable topic in its own right. 

1.2 Motivations 

What happened to Hilbert's man in the street? 

— Shreeram S. Abhyankar 

There are essentially four groups of people, who have been instrumental in 
the rapid growth of the subject of "algorithmic algebra." Although, in some 
sense, all of the four groups are working toward a common goal, namely, 
that of developing an algorithmic (read, constructive) foundation for vari- 
ous problems in algebra, their motivations differ slightly from one another. 
The distinction is, however, somewhat artificial, and a considerable overlap 
among these communities is ultimately unavoidable. 
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1.2.1 Constructive Algebra 

One of the main issues that concerns the constructive algebraists is that 
of the philosophical foundations of mathematics. We have alluded to this 
issue in the introductory section, and will refer to this as "the theological 
issue." 

During the last century, the movement of "analysis" toward noncon- 
structive concepts and methods of proof had a considerable ideological 
impact on traditionally constructive areas such as algebra and number the- 
ory. In this context, there were needs for a revision of what was under- 
stood by the "foundations of mathematics." Some mathematicians of the 
time, most prominently Kronecker, attacked the emergent style of non- 
constructivity and defended the traditional views of foundations espoused 
by their predecessors. However, to most mathematicians of the time, the 
constraints imposed by constructivity appeared needlessly shackling. It 
was historically inevitable that the nonconstructivity implied in the Canto- 
rian/Weirstrassian view of the foundation of mathematics would dominate. 
Indeed, Dedekind, a student of Kronecker and a prominent algebraist on 
his own, "insisted it was unnecessary — and he implied it was undesirable — 
to provide an algorithmic description of an ideal, that is, a computation 
which would allow one to determine whether a given ring element was or 
was not in the ideal." [67] Kronecker's view, on the other hand, can be sur- 
mised from the following excerpts from Edwards' paper on "Kronecker's 
Views on the Foundations of Mathematics" [67] : 

Kronecker believed God made the natural numbers and all the rest 
was man's work. We only know of this opinion by hearsay evidence, 
however, and his paper Ueber den Zahlbegriff indicates to me that he 
thought God made a bit more: Buchstabenrechnung , or calculation 
with letters. In modern terms, Kronecker seems to envisage a cosmic 
computer which computes not just with natural numbers, but with 
polynomials with natural number coefficients (in any number of in- 
determinates) . That's the God-given hardware. The man-made soft- 
ware then creates negative numbers, fractions, algebraic irrationals, 
and goes on from there. Kronecker believed that such a computer, 
in the hands of an able enough programmer, was adequate for all the 
purposes of higher mathematics. 

A little further on, Edwards summarizes Kronecker's views as follows: 
"Kronecker believed that a mathematical concept was not well defined un- 
til you had shown how, in each specific instance, to decide [algorithmically] 
whether the definition was fulfilled or not." 

Having said this, let us use the following anecdote to illustrate the de- 
bates of the time regarding the foundations of mathematics. This concerns 
the seminal nonconstructive argument of Hilbcrt (Hilbert's basis theorem) 
that every ideal in the ring of polynomials in several variables over a field is 
finitely generated. In applying this theorem to Gordon's problem of finding 
a finite set of generators for certain rings of invariant forms, Hilbert reduced 
this problem to that of finding finite sets of generators for certain ideals. 
As the rings and associated ideals are described in a finite way, Gordon 
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expected an explicit description of the generators. Gordon had been able 
to solve his problems for two variables in a constructive manner, and was 
not happy with Hilbert's solution. Gordon dismissed Hilbert's solution as 
follows: 

"Das ist nicht Mathematik. Das ist Theologie." 
Hilbert was able to return to the original problem to give a satisfactory 
construction. We will discuss this particular problem in greater detail. 

A more clear and concrete view regarding constructivity appears to have 
emerged only very recently. According to this view, the constructive alge- 
bra differs significantly from the classical mathematics by its interpretation 
of "existence of an object." "In the classical interpretation, an object exists 
if its nonexistence is contradictory. There is a clear distinction between this 
meaning of existence and the constructive, algorithmic one, under which an 
object exists only if we can construct it, at least in principle. As Bishop has 
said, such 'meaningful distinctions deserve to be maintained'. [23]" One can 
further restrict what one means by the word "construction." According to 
G. Hermann, "the assertion that a computation can be carried through in 
a finite number of steps shall mean that an upper bound for the number 
of operations needed for the computation can be given. Thus, it does not 
suffice, for example, to give a procedure for which one can theoretically 
verify that it leads to the goal in a finite number of operations, so long as 
no upper bound for the number of operations is known." 

There are other motivation for studying constructive algebra: it adds 
depth and richness to classical algebra. For instance, given the latitude 
one has in specifying ideals, Hilbert's proof of the basis theorem had to 
be nonconstructive — thus, in a constructive setting, one is led to explore a 
much finer structure (such as Noetherianness, coherence) of the underlying 
polynomial ring in order to provide a satisfactory answer. 

And, of course, this provides a stepping stone for theoretical computer 
scientists to study the design and implementation of efficient algorithms. 
Once we understand what algebraic objects are amenable to constructive 
treatment, we can study how we can improve the associated algorithms 
and how these objects can be used to solve important practical problems. 

1.2.2 Algorithmic and Computational Algebra 

A prominent algebraic geometer advocating the algorithmic view point is 
Abhyankar. In his paper "Historical Rambling in Algebraic Geometry," 
Abhyankar [2] categorizes algebraic geometry into three classes (roughly, 
in terms of their algorithmic contents): "high school algebra" (Newton, 
Tschirnhausen, Euler, Sylvester, Cayley, Kronecker, Macaulay), "college 
algebra" (Dedekind, Noethcr, Krull, Zariski, Chevally, Cohen) and "uni- 
versity algebra" (Serre, Cartan, Eilenberg, Grothcndicck, Mumford), and 
calls for a return to the algorithmic "high school algebra" : 

The method of high-school algebra is powerful, beautiful and acces- 
sible. So let us not be overwhelmed by the groups-rings-fields or the 
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functorial arrows of [college or university] algebras and thereby lose 
sight of the power of the explicit algorithmic processes given to us 
by Newton, Tschirnhausen, Kronecker and Sylvester. 

The theoretical computer scientists take Abhyankar's viewpoint to the 
extreme: they regard the existence of a construction as only a first step 
toward a precise classification of the inherent computational complexity of 
an algebraic problem. A theoretical computer scientist would be concerned 
with questions of the following kinds: 

• What are the resource complexities associated with an algebraic prob- 
lem? Is a certain set of algebraic problems intcrreducible to one an- 
other, thus making it sufficient to look for an efficient solution to any 
one of the problems in the class? That is, are there classes of alge- 
braic problems that are isomorphic to one another in terms of their 
resource requirements? (Note that as algebraic problems, they may 
be addressing rather unrelated questions.) 

• Is a particular problem computationally feasible? If not, are there 
restrictive specializations that can be made feasible? Can random- 
ization help? 

• How does the problem depend on various models of computation? 
Can the problem be easily parallelized? Can preprocessing, or pre- 
conditioning, help? 

• What is the inherent complexity of the problem? Given an algorithm 
for a problem, can we say whether it is the best possible solution in 
terms of a particular resource complexity? 

• What are the basic ingredients required to translate these algorithms 
to usable implementations? For instance, how are numbers to be rep- 
resented: in finite precision, or in infinite precision (algebraic num- 
ber)? How are algebraic numbers to be stored internally: in terms 
of an algorithm, or by its minimal polynomial and a straddling inter- 
val? What kind of data structures are most suitable to a particular 
problem? 

1.2.3 Symbolic Computation 

In 1953, the first modern computer programs to perform symbolic compu- 
tation were realized in two master's theses: one by H.G. Kahrimanian at 
Temple University [108] and another by J.F. Nolan at the Massachusetts 
Institute of Technology [157]. The differentiation program developed by 
Kahrimanian for UNIVAC I took as its input an expression represented 
as a linearized binary tree and produced the derivative of the expression. 

After the development of the Lisp language by McCarthy, the prob- 
lem of developing symbolic mathematical systems became relatively easy. 
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James Slagle (a student of Minsky) developed an integration program called 
SAINT in Lisp in 1962. The program was rudimentary and lacked a strong 
mathematical foundation, but was still able to perform at the level of a 
freshman calculus student. 

During the early sixties, the next important step was the development 
of general-purpose systems aimed at making computerized mathematical 
computation accessible to laymen. Notable among such developments: AL- 
PAK [27] and ALTRAN [25] at Bell Laboratories by a group headed by 
W.S. Brown, and FORMAC [181] at I.B.M. under the guidance of J.E. 
Sammct. FORMAC was somewhat limited in scope in comparison to 
ALPAK and ALTRAN, since it dealt exclusively with polynomial and 
rational functions. 

Around the same time, G. Collins of the University of Wisconsin had 
been developing PM [48] , a polynomial manipulation system, which utilized 
an efficient canonical recursive representation of polynomials and supported 
arbitrary precision arithmetic. The PM system was later supplanted by 
SAC-1 [49], which could perform operations on multivariate polynomials 
and rational functions with infinite precision coefficients. The algorithms 
in SAC-1 were based on the decision procedure invented by Tarski, Sei- 
denberg and Cohen for the elementary theory of a real closed field. These 
algorithms have widespread applications in various areas of computer sci- 
ence and robotics, and will be discussed at length in this book. An improved 
version of SAC-1, called SAC-2 [36] and written in an algebraic language 
Aldes, succeeded the older system. 

Staring in the late sixties, the focus shifted to the development of 
symbolic manipulation systems that allowed a more natural interactive 
usage. The significant systems in this category included: Engleman's 
MATHLAB-68 developed at M.I.T. [69], Tony Hearn's REDUCE-2 de- 
veloped at Rand and University of Utah [165], Barton, Bourne and Fitch's 
CAMAL system (CAMbridge ALgebra system) [71], Moses and Martin's 
MACSYMA developed under the MAC project at M.I.T. [88], Griesmer 
and Jcnks's SCRATCHPAD system developed at I.B.M. [106] and more 
recently Jenks and Sutor's AXIOM system that evolved from SCRATCH- 
PAD [107]. 

While a detailed comparison of these systems would be fairly hard, we 
note that they differ from one another in their design goals. MATHLAB- 
68 is a general-purpose system, designed to perform differentiation, polyno- 
mial factorization, indefinite integration, direct and inverse Laplace trans- 
forms and the solution of differential equations with symbolic coefficients. 
REDUCE is a general-purpose software system with built-in algebraic sim- 
plification mechanisms, and thus it allows a user to build his own programs 
to solve "superdifficult" problems [165] with relative ease; this system has 
been successfully used to solve problems in QED, QCD, celestial mechanics, 
fluid mechanics, general relativity, plasma physics and various engineering 
disciplines. CAMAL is a small, fast, powerful and yet general-purpose 
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system consisting of three modules: F-modulc for Fourier series, E-modulc 
for complex exponential series and H-modulc (the "Hump"), a general- 
purpose package. In comparison to the above systems, both MACSYMA 
and SCRATCHPAD systems are "giants" and are designed to incorpo- 
rate all the state-of-the-art techniques in symbolic algebra and software 
engineering. 

The number of algebraic systems has grown at a tremendous rate in 
the recent past. An estimate given by Pavelle, Rothstcin and Fitch is 
that in the last thirty years, about sixty systems have been developed for 
doing some form of computer algebra. The more notable ones among these 
are SMP, developed by Cole and Wolfram at CalTech and the Institute 
for Advanced Studies, MAPLE, developed at the University of Waterloo, 
Bergman's PROLOG-based SYCOPHANTE system, Engeli's SYMBAL 
system, Rich and Stoutemyr's muMATH system for I.B.M. PC's and Jenks 
and Sutors's SCRATCHPAD/AXIOM system. 

In the last few years, the general-purpose computer algebra system 
MATHEMATICA [209] developed by Wolfram Research, Inc., and run- 
ning on several personal computers (including Macintosh II and NeXT 
computers) has brought symbolic computation to the domain of everyday 
users. Other notable recent systems with similar interfaces and achieve- 
ments include MAPLE and SCRATCHPAD/AXIOM. It is hoped these 
systems will influence, to a substantial degree, the computing, reasoning 
and teaching of mathematics [186]. 

The main goal of the researchers in this community has been to develop 
algorithms that are efficient in practice. Other related issues that concern 
this group involve developing languages ideal for symbolic computation, 
easy-to-use user interfaces, graphical display of various algebraic objects 
(i.e., algebraic curves, surfaces, etc.), and computer architecture best suited 
for symbolic manipulation. 

1.2.4 Applications 

The last motivation for the study of computational algebra comes from its 
wide variety of applications in biology (e.g., secondary structure of RNA), 
chemistry (e.g., the nature of equilibria in a chemical process), physics 
(e.g., evaluation of Feynman diagrams), mathematics (e.g., proof of the 
Macdonald-Morris conjecture), computer science (e.g., design of the IEEE 
standard arithmetic) and robotics (e.g., inverse kinematic solution of a mul- 
tilinked robot). Some of the major applications of symbolic computational 
algebra in various subareas of computer science are summarized as follows: 

1. Robotics: Most of the applications of computational algebra in 
robotics stem from the algebraico-geometric nature of robot kinemat- 
ics. Important problems in this area include the kinematic modeling 
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of a robot, the inverse kinematic solution for a robot, the computation 
of the workspace and workspace singularities of a robot, the planning 
of an obstacle-avoiding motion of a robot in a cluttered environment, 
etc. 

2. Vision: Most of the applications here involve the representation of 
various surfaces (usually by simpler triangulated surfaces or general- 
ized cones), the classification of various algebraic surfaces, the alge- 
braic or geometric invariants associated with a surface, the effect of 
various affinc or projective transformation of a surface, the descrip- 
tion of surface boundaries, etc. 

3. Computer- Aided Design (CAD): Almost all applications of CAD 
involve the description of surfaces, the generation of various auxil- 
iary surfaces such as blending surfaces, smoothing surfaces, etc., the 
parametrization of curves and surfaces, various Boolean operations 
such as union and intersection of surfaces, etc. 

Other applications include graphical editors, automated (geometric) 
theorem proving, computational algebraic number theory, coding theory, 
etc. 

To give an example of the nature of the solution demanded by various 
applications, we will discuss a few representative problems from robotics, 
engineering and computer science. 

Robot Motion Planning 

• GIVEN: The initial and final (desired) configurations of a robot (made 
of rigid subparts) in two- or three-dimensional space. 

The description of stationary obstacles in the space. 

The obstacles and the subparts of the robot are assumed to be rep- 
resented as the finite union and intersection of algebraic surfaces. 

• Find: Whether there is a continuous motion of the robot from the 
initial configuration to the final configuration. 

The solution proceeds in several steps. The first main step involves 
translating the problem to a parameter space, called the C-space. The 
C-space (also called configuration space) is simply the space of all points 
corresponding to all possible configurations of the robot. 

The C-space is usually a low-dimensional (with the same dimension as 
the number of degrees of freedom of the robot) algebraic manifold lying in 
a possibly higher-dimensional Euclidean space. The description and com- 
putation of the C-space are interesting problems in computational algebra, 
and have been intensely studied. 

The second step involves classifying the points of the C-space into two 
classes: 
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• Forbidden Points: A point of C-space is forbidden if the corresponding 
configuration of the robot in the physical space would result in the 
collision of two subparts of the robot and/or a subpart of the robot 
with an obstacle. 

• Free Points: A point of C-space that is not forbidden is called a free 
point. It corresponds to a legal configuration of the robot amidst the 
obstacles. 

The description and computation of the free C-space and its (path) connect- 
ed components are again important problems in computational algebra, 
perhaps not dissimilar to the previous problems. Sometimes the free space 
is represented by a stratification or a decomposition, and we will have to 
do extra work to determine the connectivity properties. 

Since the initial and final configurations correspond to two points in the 
C-space, in order to solve the motion planning problem, we simply have to 
test whether they lie in the same connected component of the free space. 
This involves computing the adjacency relations among various strata of the 
free space and representing them in a combinatorial structure, appropriate 
for fast search algorithms in a computer. 

Offset Surface Construction in Solid Modeling 

• GIVEN: A polynomial f(x 7 y, z), implicitly describing an algebraic 
surface in the three-dimensional space. That is, the surface consists 
of the following set of points: 



• Compute: The envelope of a family of spheres of radius r whose 
centers lie on the surface /. Such a surface is called a (two-sided) 
offset surface of /, and describes the set of points at a distance r on 
both sides of /. 

First we need to write down a set of equations describing the points on 
the offset surface. Let p = (x, y, z) be a point on the offset surface and 
q = (u, v, w) be a footprint of p on /; that is, q is the point at which a 
normal from p to f meets /. Let t\ = (ti,i, ii.2, £1,3) and ti = {t2.i1 ^2.2, 
£2,3) be two linearly independent tangent vectors to / at the point q. Then, 
we see that the offset surface is given by: 





= 
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A (x — u, y — v, z — w) ■ t\ = 
A (x — u,y — v, z — w) ■ t 2 = 



Thus the system of polynomial equations given below 



(x - u) 2 + (y- v) 2 + (z- w) 2 - r 2 

f(u,v,w) 



o. 
o. 

0. 



(1.1) 

(1.2) 
(1.3) 
(1.4) 



(a; - u)ti ; i + (y - v)t ia + (z - w)t lt3 
(x - u)t 2 ,i + (y- v)t 2 ,2 + {z- W)t 2 ,3 







describes a hypersurfacc in the six-dimensional space with coordinates 
(x, y, z, u, v, w), which, when projected onto the three-dimensional space 
with coordinates (x, y, z), gives the offset surface in an implicit form. The 
offset surface is computed by simply eliminating the variables u, v, w from 
the preceding set of equations. Note that equation (1.1) states that the 
point (x, y, z) on the offset surface is at a distance r from its footprint 
(u, v, w); the last three equations (1.2), (1.3), (1.4) ensure that (u, v, w) 
is, indeed, a footprint of (x, y, z). 

The envelope method of computing the offset surface has several prob- 
lematic features: The method does not deal with self-intersection in a clean 
way and, sometimes, generates additional points not on the offset surface. 
For a discussion of these problems, and their causes, see the book by CM. 
Hoffmann [99]. 

Geometric Theorem Proving 

• Given: A geometric statement, consisting of a finite set of hypotheses 
and a conclusion. It is assumed that the geometric predicates in the 
hypotheses and the conclusion have been translated into an analytic 
setting, by first assigning symbolic coordinates to the points and then 
using the polynomial identities (involving only equalities) to describe 
the geometric relations: 



• Decide: Whether the conclusion g = is a consequence of the hy- 
potheses (/i = OA •••A f r = 0). That is, whether the following 
universally quantified first-order formula holds: 



Hypotheses 
Conclusion 



f l {x ll ...,x n ) =0,...,f r (x 1 ,...,x n ) =0. 
g(x!, ...,x n )=Q. 




(1.5) 
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One way to solve the problem is by first translating it into the follow- 
ing form: Decide if the existentially quantified first-order formula, shown 
below, is unsatisfiable: 



The logical equivalence of the formulas (1.5) and (1.6), when the underlying 
domain is assumed to be a field, is fairly obvious. (Reader, please convince 
yourself.) 

However, the nature of the solutions may rely on different techniques, 
depending on what we assume about the underlying fields: For instance, if 
the underlying domain is assumed to be the field of real numbers (a real 
closed field) , then we may simply check whether the following multivariate 
polynomial (in xi, . . ., x n , z) has no real root: 



If, on the other hand, the underlying domain is assumed to be the 
field of complex numbers (an algebraically closed field), then we need to 
check if it is possible to express 1 as a linear combination (with polynomial 
coefficients) of the polynomials /i, f r and (gz — 1), i.e., whether 1 
belongs to the ideal generated by /i, . . . , f r , (gz — 1). Another equivalent 
formulation of the problem simply asks if g is in the radical of the ideal 
generated by /i, . . . , f r . The correctness of these techniques follow via 
Hilbert's Nullstellensatz. 

Later on in the book, we shall discuss, in detail, the algebraic problems 
arising in both situations. (See Chapters 4 and 8.) 

1.3 Algorithmic Notations 

As our main goal will be to examine effective algorithms for computing with 
various algebraic structures, we need a clear and unambiguous language for 
describing these algorithms. In many cases, a step-by-step description of 
algorithms in English will be adequate. But we prefer to present these 
algorithms in a fairly high-level, well-structured computer language that 
will borrow several concepts from ALGOL [206] and SETL [184]. Occa- 
sionally, we will allow ourselves to describe some of the constituent steps, 
in a language combining English, set theory and mathematical logic. 

1.3.1 Data Structures 

The primitive objects of our language will consist of simple algebraic ob- 
jects such as Booleans, groups, rings, fields, etc., with their associated 
algebraic operations. For instance, we may assume that the language pro- 
vides mechanisms to represent real numbers, and supports operations such 




fi = A • • • A f r = A gz - 1 = . 



(1.6) 



A 2 + • • • + fr + {gz - if. 
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as addition, subtraction, multiplication and division. We shall assume that 
each of these algebraic operations can be performed "effectively," in the 
sense that the operation produces the correct result in a finite amount of 
time. We shall also regard an interval as a primitive: an interval [j..k] is a 
sequence of integers j, j + 1, . . ., k, if j < k, and an empty sequence other- 
wise. The notation i £ [j.-k] (read, "i belongs to the interval [j. .£;]") means 
i is an integer such that j < i < k. Occasionally, we shall also use the 
notation [j,k..l] (j ^ k) to represent the following arithmetic progression 
of integers: j , j + (k - j), j + 2(k - j), . . ., j + [(l - j)/(k - j)\(k - j). The 
notation i £ [j, k..l] (read, "i belongs to the arithmetic progression [j, k..l] n ) 
means that i = j + a(k — j), for some integer < a < [(I — j)/(k — j)\ . 

The main composite objects in the language are tuples and sets. An 
ordered n-tuple T = (xi, X2, • ■ •, x n ) is an ordered sequence of n elements 
(primitive or composite), some of which may be repeated. The size of the 
tuple T is denoted by \T\, and gives the number of elements in T. The 
empty tuple is denoted by ( ). The i th element of an n-tuplc T (1 < i < n) 
is denoted by T[i). A (j — i + 1) subtuplc of an n-tuple T = (xi, x 2 , ■ ■ ., 
Xn) (1 < i < j < n), consisting of elements Xi through Xj, is denoted by 
T[i,j]. Note that T[i,i] is a 1-tuple (xj), whereas T[i] is simply the i th 
element of T, Xj. Given an m-tuple T\ = (xi t \, Xi ; 2, . . ., Xi >m ) and an 

71-tuple ?2 — (^2,1, X2,2; • • -^2,n 

), their concatenation, T\ 0T2, denotes an 
(to + n)-tuple (xi ; i, xi ;2 , . . ., xi >m , x 2 ,i, x 2 , 2 , • ■ ■, x 2 ,„>. 

We can also represent arbitrary insertion and deletion on tuples by 
combining the primitive operations subtuples and concatenation. Let T be 
a tuple and x an arbitrary element. Then 

Head(T) = return T[l] 

Tail(T) = return T[\T\] 

Push(2:,r) = return (a;) o T 

Pop(T) ee return T[2..\T\] 

Inject (x,T) ee return To (a;) 

Eject(T) ee return T[1..\T\ - 1] 

Using these operations, we can implement stack (with head, push and pop), 
queue (with head, inject and pop) or a deque (with head, tail, push, pop, 
inject and eject). 

A set S — {xi, x 2 , . . ., x n } is a finite collection of n distinct elements 
(primitive or composite). The size of the set S is denoted by \S\, and gives 
the number of elements in S. The empty set is denoted by (or, sometimes, 
{ }). The operation Choose(S) returns some arbitrary element of the set 
S. The main operations on the sets are set- union U, set-intersection ft 
and set-difference \: If S\ and 52 are two sets, then 5i U S2 yields a set 
consisting of the elements in Si or 5*2 , Si fl S2 yields a set consisting of the 
elements in Si and 5*2 , and Si \ S2 yields a set consisting of the elements 
in S*i but not in S 2 . 

We can also represent arbitrary insertion and deletion on sets by com- 
bining the primitive set operations. Let 5 be a set and x an arbitrary 
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element. Then 



Insert (x, S) 
Deleters) 



return S U {x} 
return S \ {x} 



1.3.2 Control Structures 

A program consists of a sequence of statements, the most basic operation 
being the assignment. The symbol := denotes the assignment and the 
symbol ; the sequencer or the statement separator. Thus the assignment 
statement, X, :— expression first evaluates the expression in the right-hand 
side, then deposits the value of the expression in the location corresponding 
to the variable X; in the left-hand side. We also write 



to denote the parallel assignment of the values of the components of the 
n-tuple of expressions in the right-hand side, in the locations corresponding 
to the ro-tuple of variables (xi, . . . ,x n ) in the left-hand side. Interesting 
examples of such parallel assignments are the following: 

(x,y) := (y,x) 

swaps the values of the variables x and y; 

(xi, . . . , Xj_j+i) := (expression!, . . . , expression n )[i..j] 

selects the values of the expressions i through j . 

In a program, a Boolean expression corresponds to a propositional state- 
ment consisting of atomic predicates, and the connectives or, and and 
not. We also use the connectives cor (conditional or) and cand (con- 
ditional and) with the following semantics: in " Boolean conditioni cor 
Boolean conditioni" the second Boolean condition is evaluated, only if 
the first condition evaluates to "false;" and in "Boolean conditioni cand 
Boolean condition2," the second Boolean condition is evaluated, only if the 
first condition evaluates to "true." 

We use three main control structures: 

If-Then-Else: 



(xi, . . . , x n ) := (expression!, . . . , expression^) 



if Boolean conditioni then 

statement i 
elsif Boolean condition^ then 

statement^ 



else statement 



71 



end{if } 
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The effect of this statement is to cause the following execution: First, the 
Boolean conditions, Boolean conditiom, Boolean condition?., . . ., are evalu- 
ated sequentially until a "true" Boolean condition is encountered, at which 
point, the corresponding statement is executed. If all the Boolean condi- 
tions evaluate to "false," then the last statement, statement n , is executed. 

Loop: The loop statements appear in two flavors: 

while Boolean condition loop 

statement 
end{loop } 

The effect of this statement to cause the following execution: First, the 
Boolean condition is evaluated, and if it evaluates to "true," then the state- 
ment is executed. At the end of the statement execution, the control passes 
back to the beginning of the loop and this process is repeated as long as the 
Boolean condition continues to evaluate to "true;" if the Boolean condition 
evaluates to "false," then the control passes to the next statement. 

loop 

statement 
until Boolean condition 
end{loop } 

The effect of this statement to cause the following execution: First, the 
statement is executed. At the end of the statement execution, the Boolean 
condition is evaluated. If it evaluates to "false," then the control passes 
back to the beginning of the loop and the process is repeated; if the Boolean 
condition evaluates to "true," then the control passes to the next statement. 

For-Loop: Generally, the for-loop statements appear in the following 
form: 

for every iterator value loop 

statement 
end{loop } 

The effect of a for-loop statement is to cause the statement to be evaluated 
once for each value of the iterator. An iterator may appear in one of the 
following forms: 

1. 11 i G \j-k)" the statement is evaluated k — j + 1 times once for each 
value of i (in the order, j, j + 1, ■ . ., fc); 

2. "i e [j, fc.J]," the statement is evaluated [(I— j)/(k— j)\ +1 times once 
for each value of i (in the order j , k, . . ., j + [(I — j)/(k — j)\ (k — j)); 
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3. "x £ T," where T is a tuple, the statement is evaluated \T\ times 
once for each value of x in T, according to the order imposed by T; 
and 

4. "x £ S," where S is a set, the statement is evaluated | ^S' | times once 
for each value of x in S, in some arbitrary order. 

A program will be organized as a set of named modules. Each module 
will be presented with its input and output specifications. The modules 
can call each other in mutual-recursive or self-recursive fashion; a module 
calls another module or itself by invoking the name of the called module 
and passing a set of parameters by value. When a called module completes 
its execution, it either returns a value or simply, passes the control back to 
the calling module. For each module, we shall need to prove its correctness 
and termination properties. 

As an example of the usage of the notations developed in this section, let 
us examine the following algorithm of Euclid to compute the GCD (greatest 
common divisor) of two positive integers X and Y. In the program the 
function Remaindcr(A", Y) is assumed to produce the remainder, when Y 
is divided by X. 

GCD(X, Y) 

Input: Two positive integers X and Y. 

Output: The greatest common divisor of X and Y, i.e., a positive 
integer that divides both X and Y and is divisible 
by every divisor of both X and Y. 

if X > Y then 

(X,Y) := (Y,X) 
end{if }; 

while X does not divide Y loop 

(X,Y) := (Remainder(X,T),X) 
end{loop }; 

return X; 
endfGCD} □ 

Theorem 1.3.1 The program GCD correctly computes the greatest com- 
mon divisor of two positive integers. 
PROOF. 

Let (Xq,Y ) be the input pair, and (Xi,Yi), (X 2 ,Y 2 ), . . ., (X n ,Y n ) be 
the values of X and Y at each invocation of the while-loop. Since X > 
X\ > ■ ■ ■ X n , and since they are all positive integers, the program must 
terminate. 

Furthermore, for all < i < n, every divisor of Xi and Yi is also a 
divisor X i+ i, and every divisor of X i+ \ and Y i+1 is also a divisor of Yi. 
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Hence, 

GCD(X ,Y ) = GCB(X 1 ,Y 1 ) = ■■■ = GCB(X n ,Y n ). 

But since GCD(X n , Y n ) is clearly X n , the value returned by the program, 
X n , is the greatest common divisor of X and Y. □ 



1.4 Epilogue 

We conclude this chapter with the following poem by Abhyankar, which 
succinctly captures a new spirit of constructiveness in algebra: 

Polynomials and Power Series, 
May They Forever Rule the World 

Shreeram S. Abhyankar 

Polynomials and power series. 

May they forever rule the world. 

Eliminate, eliminate, eliminate. 

Eliminate the eliminators of elimination theory. 

As you must resist the superbourbaki coup, 

so must you fight the little bourbakis too. 

Kronecker, Kronecker, Kronecker above all 

Kronecker, Mertens, Macaulay, and Sylvester. 

Not the theology of Hilbert, 
But the constructions of Gordon. 

Not the surface of Riemann, 
But the algorithm of Jacobi. 

Ah! the beauty of the identity of Rogers and Ramanujan! 
Can it be surpassed by Dirichlet and his principle? 

Germs, viruses, fungi, and functors, 

Stacks and sheaves of the lot 

Fear them not 

We shall be victors. 

Come ye forward who dare present a functor, 
We shall eliminate you 

By resultants, discriminants, circulants and alternants. 
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Given to us by Kronecker, Mertens, Sylvester. 

Let not here enter the omologists, homologists, 

And their cohorts the cohomologists crystalline 
For this ground is sacred. 

Onward Soldiers! defend your fortress, 

Fight the Tor with a determinant long and tall, 

But shun the Ext above all. 

Morphic injectives, toxic projectives, 
Etal, eclat, devious devisage, 

Arrows poisonous large and small 

May the armor of Tschirnhausen 

Protect us from the scourge of them all. 

You cannot conquer us with rings of Chow 

And shrieks of Chern 
For we, too, are armed with polygons of Newton 

And algorithms of Perron. 
To arms, to arms, fractions, continued or not, 

Fear not the scheming ghost of Grothendieck 
For the power of power series is with you, 

May they converge or not 

(May they be polynomials or not) 

(May they terminate or not). 

Can the followers of G by mere "smooth" talk 
Ever make the singularity simple? 
Long live Karl Weierstrass! 

What need have we for rings Japanese, excellent or bad, 
When, in person, Nagata himself is on our side. 

What need to tensorize 

When you can uniformize, 
What need to homologize 

When you can desingularize 

(Is Hironaka on our side?) 

Alas! Princeton and fair Harvard you, too, 

Reduced to satellite in the Bur-Paris zoo. 
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book. Other useful expository materials in this area include the books by Bartels 
et al. [14], Farin [70], Mantyla [138], Mortenson [155], Su and Liu [196], and the 
survey papers by Requicha and co-worker [171, 172]. 

For additional discussion on the subject of geometric theorem proving and 
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its relation to computational algebra, we refer the readers to the works of Tarski 
[200], Chou [46, 47], Davis and Cerutti [60], Gelernter et al. [80], Kapur [113], 
Ko and Hussain [117], Kutzler and Stifter [122], Scott [186] and Wu [209-211]. 

The poem by Abhyankar in the Epilogue was written in August 1970 during 
the International Conference in Nice, and was inspired by van der Waerden's 
historical lecture on the development of algebraic topology. 
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puter algebra syllabus developed at the Research Institute for Symbolic Compu- 
tation at Johannes Kepler University, Linz, Austria (Appendix B, [22]): computer 
algebra I (algorithms in basic algebraic domain), computer algebra II (advanced 
topics, e.g., algorithmic polynomial ideal theory) and parts of computational ge- 
ometry II (algebraic algorithms in geometry). All our algorithms, however, will 
be presented without any analysis of their computational complexity, although, 
for each of the algorithms, we shall demonstrate their termination properties. 
There are quite a few textbooks available in this area, and the reader is urged to 
supplement this book with the following: the books by Akritas [4] , Davenport et 
al. [58], Lipson [132], Mignotte [145], Sims [191], Stauffer et al. [194], Yap [213], 
Zimmer [217], Zippel [218] and the mongraph edited by Buchberger et al. [34]. 

There are several journals devoted to computational algebra and its applica- 
tions; notable among these are Journal of Symbolic Computation, started in 1985, 
and Applicable Algebra in Engineering, Communication and Computer Science, 
started in 1990. Other important outlets for papers in this area are the SIAM 
Journal on Computing and the ACM Transactions on Mathematical Software. 
There are several professional societies, coordinating the research activities in this 
area: ACM SIGSAM (the Association for Computing Machinery Special Interest 
Group on Symbolic and Algebraic Manipulation), SAME (Symbolic and Algebraic 
Manipulation in Europe) and ISSAC (International Symposium on Symbolic and 
Algebraic Computation). Other societies, such as AMS (American Mathematical 
Society), AAAS (American Association for the Advancement of Science), ACS 
(American Chemical Society), APS (American Physical Society) and IEEE (The 
Institute of Electrical and Electronics Engineers), also cover topics in computer 
algebra. 
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Algebraic Preliminaries 



2.1 Introduction to Rings and Ideals 

In this chapter, we introduce some of the key concepts from commutative 
algebra. Our focus will be on the concepts of rings, ideals and modules, 
as they are going to play a very important role in the development of the 
algebraic algorithms of the later chapters. In particular, we develop the 
ideas leading to the definition of a basis of an ideal, a proof of Hilbert's basis 
theorem, and the definition of a Grobner basis of an ideal in a polynomial 
ring. Another important concept, to be developed, is that of a syzygy of a 
finitely generated module. 

First, we recall the definition of a group: 

Definition 2.1.1 (Group) A group G is a nonempty set with a binary 
operation {product, •) such that 

1. G is closed under the product operation. 

a-beG . 

2. The product operation is associative. That is, 

(a • b) ■ c = a ■ (b ■ c) . 

3. There exists (at least) one element e e G, called the (left) identity, 
so that 

e ■ a = a . 

4. Every element of G has a (left) inverse: 

a -1 ■ a = e . 



VoieG 



( V a,b,ce G) 



fV a e G J 



(V a e Gj [3 a- 1 e GJ 
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The set G is said to be a semigroup if it satisfies only the first two condi- 
tions, i.e., it possesses an associative product operation, but does not have 
an identity element. 

A group is called Abelian (or commutative) if the product operation 
commutes: , , r 



For instance, the set of bijective transformations of a nonempty set S, 
with the product operation as the composite map, and the identity as the 
identity map, form the so-called symmetric group of the set S, Sym S. In 
particular, if S = {1, 2, . . ., n}, then Sym S — S n , the symmetric group of 
n letters; the elements of S n are the permutations of {1, 2, . . ., n}. If, on 
the other hand, we had considered the set of all transformations (not just 
the bijective ones) of a nonempty set S, the resulting structure would have 
been a semigroup with identity element. (A transformation is invertible if 
and only if it is bijective). 

Other examples of groups are the following: 

1. (Z, +, 0), the group of integers under addition; the (additive) inverse 
of an integer a is —a. 

2. (Q*, •, 1), the group of nonzero rational numbers under multiplica- 
tion; the (multiplicative) inverse of a rational p/q is q/p. 

3. The set of rotations about the origin in the Euclidean plane under 
the operation of composition of rotations. The rotation through an 
angle 9 is represented by the map (x, y) i— > (x' , y'), where 



The following are some of the examples of semigroups: 

1. (N, +, 0), the semigroup of natural numbers under addition. This 
semigroup has zero (0) as its additive identity. 

2. (Z, •, 1), the semigroup of integers under multiplication. This semi- 
group has one (1) as its multiplicative identity. 

Definition 2.1.2 (Subgroup) A subgroup G' of a group G is a nonempty 
subset of G with the product operation inherited from G, which satisfies the 
four group postulates of Definition 2.1.1. Thus, the (left) identity element 
e E G also belongs to G' , and the following properties hold for G': 




x' = x cos 9 — y sin 6, y' = x sin 9 + y cos 9. 




and 
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In fact, a subgroup can be characterized much more succinctly: a nonempty 
subset G' of a group G is a subgroup, if and only if 



If H C G is a subset of a group G, then the smallest subgroup (with 
respect to inclusion) of G containing H is said to be the group generated 
by H; this subgroup consists of all the finite products of the elements of H 
and their inverses. 

If Hi and H 2 are two arbitrary subsets of a group G, then we may 
define the product of the subsets, HiH 2 , to be the subset of G, obtained 
by the pointwise product of the elements of Hi with the elements of H 2 . 
That is, 



If Hi = {hi} is a singleton set, then we write hiH 2 (respectively, H 2 hi) to 
denote the subset HiH 2 (respectively, H 2 Hi). 

We may observe that, if Gi is a subgroup of G, then the product 
GiGi = Gi is also a subgroup of G. In general, however, the product 
of two subgroups Gi and G 2 of a group G is not a subgroup of G, except 
only when the subgroups Gi and G 2 commute: 



Definition 2.1.3 (Coset) If G' is a subgroup of a group G, and a, an 
element of G, then the subset aG' is called a left coset, and the subset G'a 
a right coset of G' in G. If a e G' , then aG' = G'a = G' . 

As each element a e G belongs to exactly one (left or right) coset of 
G' (namely, aG' or G'a), the family of (left or right) cosets constitutes a 
partition of the group G. □ 

All the cosets of a subgroup G' have the same cardinality as G', as can 
be seen from the one-to-one mapping G' — > aG' , taking g e G' to ag e aG' . 

Definition 2.1.4 (Normal Subgroup) A subgroup G' of a group G is 
called a normal (or self-conjugate) subgroup of G if G' commutes with every 
element a e G. That is, 




□ 



i2ii?2 = ^ /ii/i2 - hie Hi and /i 2 G #2 



GiG 2 — G 2 Gi. 




□ 



Definition 2.1.5 (Quotient Group) If G' is a normal subgroup of G, 
then the set 

G=\aG' : a e g} 



2G 
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consisting of the cosets of G' forms a group (under the product operation 
on subsets of G) . The coset G' is an identity element of the group G, since 



(v aG' G G) 



Furthermore, 



(V aG',bG' G G) 



G' • aG' = aG' • G' = aG' 



aG' • bG' = abG'G' = abG' G G 



(v aG', 6G', cG' G g) [(aG' • &G') • cG' = afecG' = aG' • (6G' • cG') 
and every element aG' has a left inverse (aG') _1 = a _1 G', since 



(v aG' G G) 



a^G' • aG' = a^aG' = G' 



The group of cosets of a normal subgroup G' (i.e., G, in the preceding 
discussion) is called a quotient group of G, with respect to G', and is 
denoted by G/G'. □ 

If the group is Abelian, then every subgroup is a normal subgroup. Let 
G be an Abelian group under a commutative addition operation (+) and 
G' a subgroup of G. In this case, the quotient group G/G' consists of the 
cosets a + G', which are also called the residue classes of G modulo G'. 
Two group elements a and b G G are said to be congruent modulo G', and 
denoted 

a = b mod(G'), 

if a + G' = 6 + G', i.e., a - b E G' . 

For example, the multiples of a positive integer m form a subgroup of 
(Z, +, 0), and we write 

a = b mod (to), 

if the difference a — b is divisible by to. The residue classes, in this case, 
are cosets of the form i + toZ = {i + km : k G Z}, (0 < i < to), and are 
called residue classes of Z mod m. 



2.1.1 Rings and Ideals 

Definition 2.1.6 (Ring) A ring R is a set with two binary operations 
(addition, +, and multiplication, •) such that we have the following: 

1. R is an Abelian group with respect to addition. That is, R has a zero 
element 0, and every x G R has an additive inverse —x. 

x + (-x) = . 



( V x e Rj [3 - x g i?J 
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2. R is a semigroup with respect to multiplication. Furthermore, mul- 
tiplication is distributive over addition: 



and [{y + z) ■ x = y ■ x + z ■ x] 



V x, y, z e Rj 
[x ■ (y + z) = x ■ y + x ■ z] 



We say R has an identity element if there is a 1 € R such that 



(v x e R) 



xl = lx 



The ring R is commutative if the multiplicative semigroup (i?, •) is com- 
mutative: 



(Vx,y e fl) 



.xy = yx 



□ 



The group (i?, +, 0) is known as the additive group of the ring _R. 

Some examples of rings are the following: the integers, Z, the rational 
numbers, Q, the real numbers, R, the complex numbers, C, polynomial 
functions in n variables over an ambient ring i?, R[x\, . . ., x n ] and rational 
functions in n variables over an ambient ring R, R(x\, . . ., x n ). The set of 
even numbers forms a ring without identity. 

An interesting example of a finite ring, Z> , can be constructed by con- 
sidering the residue classes of Z mod m. The residue class containing i 
is 

[i] m = i + mZ = -p + >T : 1 e Z}. 

We can define addition and multiplication operations on the elements of 
Z> as follows: 

[i]m + \j]m = [i + j] m and [i] m ■ [j] m = [ij} m . 

It can be easily verified that Z>, as constructed above, is a commutative 
ring with zero element [0] m and identity element [l] m ; it is called the ring 
of residue classes mod m. TL^ is a finite ring with m elements: [0] m , [l] m , 
. . ., [m — l] m . For the sake of convenience, Z> is often represented by the 
reduced system of residues mod m, i.e., the set {0, 1, . . ., m — 1}. 



In what follows we assume that all of our rings are com- 
mutative and include an identity element. Any violation 
of this assumption will be stated explicitly. 



A subring R' of a ring R is a nonempty subset of R with the addition 
and multiplication operations inherited from R, which satisfies the ring 
postulates of Definition 2.1.6. 

Definition 2.1.7 (Ideal) A subset I C R is an ideal if it satisfies the 
following two conditions: 
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1. I is an additive subgroup of the additive group of R: 

a-b e I ■ 

2. RI C J; I is closed under multiplication with ring elements: 

ab e I ■ 

The ideals {0} and R are called the improper ideals of R; all other ideals 
are proper. □ 

A subset J of an ideal I in R is a subideal of 7 if J itself is an ideal in 
i?. We make the following observations: 

1. If 7 is an ideal of i?, then 7 is also a subring of i?. 

2. The converse of (1) is not true; that is, not all subrings of R are ideals. 
For example, the subring Z C Q is not an ideal of the rationals. (The 
set of integers is not closed under multiplication by a rational.) 

Let a e R. Then the principal ideal generated by a, denoted (a), is 
given by 

(a) ={ra:re R}, if 1 G i?. 

The principal ideal generated by zero element is (0) = {0}, and the prin- 
cipal ideal generated by identity element is (1) = R. Thus, the improper 
ideals of the ring R are (0) and (1). 

Let ai, . . ., ak € R. Then the ideal generated by ai, . . ., ak is 

k 

(ai, ...,a k ) = jyVjaj : n e iij. 

»=1 

A subset F C I that generates I is called a basis (or, a system of generators) 
of the ideal I. 

Definition 2.1.8 (Noetherian Ring) A ring R is called Noetherian if 
any ideal of R has a finite system of generators. □ 

Definition 2.1.9 

An element x € R is called a zero divisor if there exists y ^ in i? such 
that = 0. 

An element x € i? is nilpotent if x™ = for some n > 0. A nilpotent 
element is a zero divisor, but not the converse. 

An element x <G R is a wm£ if there exists y & R such that xy = 1. The 
element y is uniquely determined by x and is written as x _1 . The units of 
i? form a multiplicative Abelian group. □ 



(V a, b e J 



f V a e flj (v 6 e /J 



Section 2.1 



Introduction to Rings and Ideals 



29 



Definition 2.1.10 

A ring R is called an integral domain if it has no nonzero zero divisor. 
A ring R is called reduced if it has no nonzero nilpotent element. 
A ring R is called a field if every nonzero element is a unit. IZI 

In an integral domain R, R \ {0} is closed under multiplication, and is 
denoted by R*; (R* , •) is itself a semigroup with respect to multiplication. 
In a field K, the group of nonzero elements, (K* , •, 1) is known as the 
multiplicative group of the field. 

Some examples of fields are the following: the field of rational numbers, 
Q, the field of real numbers, R, and the field of complex numbers, C. If p 
is a prime number, then Z, (the ring of residue classes mod p) is a finite 
field. If [s] p G Z* , then the set of elements 

[s] p , [2s] p , [(p- l)s] p 

are all nonzero and distinct, and thus, for some s' G [l..p— 1], [s's] p = [l] p ; 
hence, ([s] p ) _1 = [s'] p . 

A subfield of a field is a subring which itself is a field. If K' is a subfield 
of K , then we also say K is an extension field of K 1 . Let a G if; then the 
smallest subfield (under inclusion) of K containing K' U {a} is called the 
extension of K' obtained by adjoining a to if', and denoted by K'(a). 

The set of rationals, Q, is a subfield of the field of real numbers, K. If 
we adjoin an algebraic number, such as y/2, to the field of rationals, Q, 
then we get an extension field, Q(v^) Q R- 

Definition 2.1.11 A field is said to be a prime field, if it does not contain 
any proper subfield. It can be shown that every field K contains a unique 
prime field, which is isomorphic to either Q or Z,, for some prime number 
p. We say the following: 

1. A field K is of characteristic (denoted characteristic K = 0) if its 
prime field is isomorphic to Q. 

2. A field K is of characteristic p > (denoted characteristic K = p), 
if its prime field is isomorphic to Z, . □ 

Proposition 2.1.1 R ^ {0} is a field if and only if 1 G R and there are 
no proper ideals in R. 

PROOF. 

(=>) Let R be a field, and I C R be an ideal of R. Assume that I ^ (0). 
Hence there exists a nonzero element a £ I. Therefore, 1 = aa^ 1 G /, i.e., 
I=(1)=R. 

(<=) Let a G R be an arbitrary element of R. If a ^ 0, then the principal 
ideal (a) generated by a must be distinct from the improper ideal (0). Since 
R has no proper ideal, (a) = R. Hence there exists aniefi such that 
xa = 1, and a has an inverse in R. Thus R is a field. □ 
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Corollary 2.1.2 Every field K is a Noetherian ring. 

PROOF. 

The ideals of K are simply (0) and (1), each of them generated by a single 
element. □ 

Let R ^ {0} be a commutative ring with identity, 1 and S C R, a 
multiplicatively closed subset containing 1 (i.e., if s± and s 2 € S, then 
si ■ s 2 € S.) Let us consider the following equivalence relation "<~" on 
RxS: 

(v (n,si),(r 2 ,s 2 ) efixs) 

(ri,si) ~ (r 2 ,s 2 ) iff (3 s 3 e S) [ssC^n. - r 2 si) = 0] . 

Let Rs = fix S/ <~ be the set of equivalence classes on Rx S with respect 
to the equivalence relation <~. The equivalence class containing (r, s) is 
denoted by r/s. The addition and multiplication on Rs arc defined as 
follows: 

n r 2 s 2 ri + r 2 S! rx r 2 r x r 2 
1 = and — • — = . 

s\ s 2 sis 2 Si s 2 sis 2 

The element 0/1 is the zero element of Rs and 1/1 is the identity element 
of Rs- It is easy to verify that Rs is a commutative ring. The ring Rs is 
called the ring of fractions or quotient ring of R with denominator set S. 

If S is chosen to be the multiplicatively closed set of all non-zero divisors 
of R, then Rs is said to be the full ring of fractions or quotient ring of 
R, and is denoted by Q(R). In this case, the equivalence relation can be 
simplified as follows: 

(V (r ljSl ),(r 2 , S2 >efixS) 

~ (r 2 ,s 2 ) iff s 2 r 1 =r 2 s 1 . 

If D is an integral domain and S = D* , then Ds can be shown to be a 
field; Ds is said to be the field of fractions or quotient field of D, and is 
denoted by QF(D). The map 

i : QF(D) 
: d^d/l 

defines an embedding of the integral domain D in the field QF(D); the 
elements of the form j are the "improper fractions" in the field QF(D). 

For example, if we choose D to be the integers Z, then QF(Z) is Q, the 
field of rational numbers. 
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2.1.2 Homomorphism, Contraction and Extension 

Definition 2.1.12 (Ring Homomorphism) The map 0: R — > R' is called 
a ring homomorphism, if (f)(1) = 1 and 

(v a, 6 G i?) 0(a + 6) = 0(a) + 0(6) and 0(a 6) = 0(a) 0(6) . 

That is, respects identity, addition and multiplication. □ 

If 0: ii — > i?' and i?' — > R" are ring homomorphisms, then so is their 
composition ip o (j>. 

The kernel of a homomorphism 0: i? — > i?' is defined as: 

kcr = | a e i? : 0(a) = o|. 

The image of a homomorphism 0: i? — ^ R' is defined as: 
im0 ={a'ei?' : (3 a e R^j 0(a) = a'J|. 



Let / be an ideal of a ring R. The quotient group R/I inherits a 
uniquely defined multiplication from R which makes it into a ring, called 
the quotient ring (or residue class ring) R/I. The elements of R/I are the 
cosets of I in R, and the mapping 

: R°^°R/I 
: x i ► .x ~\~ I 

which maps iGiito its coset x + I is a surjective ring homomorphism. 
Thus the multiplication operation on R/I is as follows 

(x + I)(y + I)=xy + I. 

This definition is consistent, since, if y + I = y' + I, then y — y' £ I, i.e., 
x(y — y') = xy — xy' G I and xy + I = xy' + I. 

Proposition 2.1.3 

1. For every ring homomorphism, <ft, ker0 is an ideal. 

2. Conversely, for every ideal I C R, I = kcr for some ring homomor- 
phism 0. 

3. For every ring homomorphism, <p, im0 is a subring of R' . □ 
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Consider the ring homomorphism 



tp : Rj kcr (j) —? im0 
: x + ker0 i— » 4>{x). 

ip is a ring isomorphism, since if ip(x + ker0) = ip(y + ker0) (i.e., (p(x) = 
<p{yj), then <p(x — y) = 0(x) — (j){y) = and x — y G ker0, thus, implying 
that a; + ker <p = y + ker 0. Hence 0: R — > i?' induces a ring isomorphism: 

J?/ kcr = im (p. 

Proposition 2.1.4 Let 4>:R°—?R' be a ring homomorphism of R onto R' . 

1. If I C R is an ideal of R, then 

<p(I) = \a' G R' : (b a G /) ^(a) = a'] } 
is on ideal of R' . Similarly, if I' C i?' is an idea/ o/i?', iftera 

_1 (J') = |a G i? : (3 a' E /') 0(a) = a' } 
is an idea/ 0/ i?. 

2. There is a one-to-one inclusions preserving correspondence between 
the ideals I' of R' and the ideals I of R which contain ker (p, such 
that if I and I' correspond, then 

0(J) = J', 4>- l {i') = i- 

When I and I' correspond, <p induces a homomorphism of I onto 
and 

//ker0~/', R/I^R'/I'. □ 

Definition 2.1.13 (Contraction and Extension) Let <p: R — > R' be a 

ring homomorphism. 

1. If I' C R' is an ideal of R' then the ideal 

/' c = (p-\l') = {a G R : (3 a' G /') [0(a) = a'] } 

in R is called the contracted ideal (or, simply, contraction) of /'. If 
the underlying homomorphism can be inferred from the context, 
then we also use the notation I'{R} for the contracted ideal. In 
particular, if R is a subring of R! , then the ideal I'{R} = Rdl', and 
it is the largest ideal in R contained in /'. 
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2. If I C R is an ideal of R, then the ideal 

I e = R'4>{I) = (\a' e R' : (3 a e /) 4>(a) =a' }^ , 

i.e., the ideal generated by 4>{I) in i?' is called the extended ideal 1 
(or, simply, extension) of 7. If the underlying homomorphism <J) can 
be inferred from the context, then we also use the notation I{R'} for 
the extended ideal. In particular, if R is a subring of R' , then the 
ideal I{R'} = R'l, and R'l is the smallest ideal in R' which contains 
/. □ 

The following relations are satisfied by the contracted and extended 
ideals: 

1. I' <ZJ' => I /c C J' c , and 7 C J 7 e C J e . 

2. 7' ce C 7', and 7 ec D 7. 

3. 7' cec = 7' c , and I ece = r. 

The last relation says that if an ideal in R' is an extended ideal, then it is 
the extension of its contraction, and that if an ideal in R is a contracted 
ideal, then it is the contraction of its extension. 

Let C be the set of contracted ideals in R, and let £ be the set of 
extended ideals in R' . We see that the mapping I' I' c and I I e 
are one-to-one and are inverse mappings of C onto £ and of £ onto C, 
respectively. 

2.1.3 Ideal Operations 

Let I, J C R be ideals. Then the following ideal operations can be defined: 

1. Sum: 1 + J = ja + 6 : a e J and 6 e jj. 

It is the smallest ideal containing both / and J. 

2. Intersection: J (~l J = ja : oeJ and a € jj. 

It is the largest ideal contained in both / and J. 

3. Product: IJ — j^ILi : a « G /, 6i G J and n e n|. 

We define the powers I n (n > 0) of an ideal J as follows: conven- 
tionally, 7° = (1), and I n = I J"" 1 . Thus 7" (n > 0) is the ideal 
generated by all products xi Xi ■ ■ ■ x n in which each factor Xj belongs 
to 7. 



1 Note that <j>(I) itself is not an ideal in R' and hence, one needs to extend it sufficiently 
to obtain the smallest ideal containing <j>{I). Also, the notation R'<fi(I) does not stand 
for the elementwise product of the sets R' and <fi(I) as such a set is not necessarily an 
ideal. R 1 </>(!) may be interpreted as the ideal product, which will be defined shortly. 
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4. Quotient: J : J = |a e R : a J C 7j. 

The quotient (0) : J is called the annihilator of J (denoted ann J): 
it is the set of all a £ R such that a J = 0. 



The following are some interesting properties of the ideal operations: 

1. The operations sum, intersection and product are all commutative 
and associative. 

2. Modular Law: If I D J, then 7 n ( J + K) = J + (7 n K). This can 
be also written as follows: 



[I D J or I D K] [I n (J + K) = (I n J) + (I n K)}. 

3. I(J + K) = IJ + IK. 

Hence, (I + J)(I n J) = 1(1 n J) + J(J n J) C 7 J. 

4. ucinJ. 

Two ideals I and J are called coprime (or comaximal), if 7 + J = (1). 
Hence, we have IJ — IdJ provided that 7 and J are coprime. [Note 
that, in this case, 7 n J = (7 + J) (7 n J) C 7 J.] 

5. (a) 7 C 7 : J. 

(b) (7 : J) J C 7. 

(c) ((I:J):K)=(I:JK)=((I:K)-.j). 

(d) (fVi = ./) = fWi = A 

(e) (/ : T,i J i) = Hii 1 : J i)- 

6. (a) V7D 7. 

(b) VVT = V7. 

(c) V77= VTrT7 = V7nV7. 

(d) V7=(i) iff 7 = (1). 

(e) VTT7 = V7TT7J. 



5. Radical: V7 = \ a e R : [3 
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2.2 Polynomial Rings 

Let S be a ring, and a; be a new symbol (called a variable, or indeterminate) 
not belonging to S. An expression of the form 

f(x) = ^ aiX 1 , where a, G 5, 

i 

in which the sum is taken over a finite number of distinct integers i > 0, 
is called a univariate polynomial over the ring S 1 . The ring elements dj's 
are called the coefficients of /. [It is implicitly assumed that <Zj = 0, if 
dj is missing in the expression for f{x).\ All powers of x are assumed to 
commute with the ring elements: aix % = x l ai. 

The operations addition and multiplication of two polynomials f(x) = 
and g{x) = 5^ • fya^ 7 are defined as follows: 

/(a;) + g(x) = ^ CfcX fe , where c k = a k + b k , 
k 

f(x)-g(x) = ^^c k x k , where c k = ^ aj6j. 

k i-^j—k 

It can be easily verified that the collection of polynomials with these addi- 
tion and multiplication rules form a commutative ring with zero element 
and identity element 1. The polynomial ring, thus obtained by adjoining 
the symbol x to S, is denoted by R = S[x]. 

The degree of a nonzero polynomial f(x), (denoted deg(/)), is the high- 
est power of x appearing in /; by convention, deg(0) = — oo. 

Let x\, . . ., x n be n distinct new symbols not belonging to S. Then the 
ring R obtained by adjoining the variables x\, . . ., x n , successively, to S is 
the ring of multivariate polynomials in x\, . . ., x n over the ring S: 

R = S[xi] ■ ■ ■ [x n ] = S[x 1 , . . .,x n ]. 

Thus R consists of the multivariate polynomials of the form: 

^ ' a e 1 ,...,e rl Xi • • • X„" . 

A power product (or, a term) is an element of R of the form 
p = x^ e t > 0. 

The total degree of the power product p is 

n 

deg(p) = J^e 4 . 

i=l 
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The degree of p in any variable X{ is deg x .(p) = e^. By the expression 
PP(xi, . . ., x n ), we denote the set of all power products involving the 
variables x\, . . ., x n . 

A power product p — xf 1 ■ ■ ■ x d ^ is a multiple of a power product q = 
xl 1 ■ ■ ■ x^™ (denoted q \ p), if 

ei<di . 

Synonymously, we sayp is divisible by q. The least common multiple (LCM) 
of two power products p = xf 1 ■ ■ ■ xf^ and q — x^ 1 ■ ■ ■ x e ^ is given by 

max(di,ei) max(d„.e„) 
x 1 ■ ■ ■ x n 

The greatest common divisor (GCD) of two power products p — x^ 1 ■ ■ ■ x^ n 
and q — x^ 1 ■ ■ ■ x e ™ is given by 

min(di,ei) min(d„,e„) 
x 1 x n 



N 1 < i < nj 



A monomial is an element of R of the form m — ap where a € S is its 
coefficient and p G PP(xi, . . ., x n ) is its power product. The total degree 
of a monomial is simply the total degree of its power product. 

Thus, a polynomial is simply a sum of a finite set of monomials. The 
length of a polynomial is the number of nonzero monomials in it. The total 
degree of a polynomial / [denoted deg(/)] is the maximum of the total 
degrees of the monomials in it; again, by convention, deg(0) = — oo. Two 
polynomials are equal, if they contain exactly the same set of monomials 
(not including the monomials with zero coefficients). 

2.2.1 Dickson's Lemma 

The following lemma about the power products due to Dickson has many 
applications: 

Lemma 2.2.1 (Dickson's Lemma) Every set X C PP(xi, . . x n ) of 
power products contains a finite subset Y C X such that each p G X is a 
multiple of some power product in Y. 
PROOF. 

We use induction on the number n of variables. If n = 1 then we let Y 
consist of the unique power product in X of minimum degree. So we may 
assume n > 1. Pick any p € X, say 

p Q =x e 1 1 ■■■ x e n n . 

Then every p e X that is not divisible by p belongs to at least one of 
J27=i e i different sets: Let i = 1, . . ., n and j = 0, 1, . . ., — 1; then the 
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set Xij consists of those power products p's in X for which deg x .(p) = j. 
Let X[ - denote the set of power products obtained by omitting the factor 

x J i from power products in X^j. By the inductive hypothesis, there exist 
finite subsets Y[- C X\ - such that each power product in X-j is a multiple 
of some power product in Y/j . We define Fjj as 

{/'••'•; : /'• >,',!• 

It is then clear that every power product in X is a multiple of some power 
product in the finite set 

Y= ({po} U U V < ..) <^ X - n 

The proof of Dickson's lemma (Lemma 2.2.1) can be understood picto- 
rially as follows: 

Consider the case when n = 2; then every power product be 
associated with a representative •point with coordinates (ei, e?) in N . Note 

that every power product x^Xj 2 with e[ > ei and e' 2 > e2 is a multiple 
of x^x^ 2 ; these are the power products whose representative points are 
above and to the right of the point (ei, e<i): in Figure 2.1, the shaded 
region represents all such points. 

Thus, given a set X C PP(xi,x 2 ), we consider their representative 
points in N K . We first choose a power product x^x^ 2 G X. As all the 
points of X in the shaded region are now "covered" by x^x^ 2 , we only 
need to choose enough points to cover the remaining points of X, which 
belong to the region ([0..ei-l] xN)U(Nx [K.^-J^]). For every i, < i < ei, 
if 

i! = min fc, 

then the power product x^x| covers all the points of X in i x N. Similarly, 
for every j, < j < e2, if 

j' = min k, 

then the power product x{ x J 2 covers all the points of X in N x X Thus the 
finite set 

{x^x^ 2 } U {x\x{ : < i < ei} U {x(x> 2 : < j < e 2 } 

is the desired set Y C X. 

Let R — S[xi, . . ., x n ] be a polynomial ring over an ambient ring S. Let 
G C i? be a (possibly, infinite) set of monomials in i?. An ideal / = (G), 
generated by the elements of G is said to be a monomial ideal. Note that 
if J ^ 7 is a subideal of 7, then there exists a monomial m £ I\J. 
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Theorem 2.2.2 Let K be a field, and I C K[x\, . . ., x n ] be a monomial 
ideal. Then I is finitely generated. 

PROOF. 

Let G be a (possibly, infinite) set of monomial generators of /. Let 

l = {pG PP(xi, . . . , x n ) : ap e G, for some a G K}. 

Note that (X) = (G) = /. 

m = ap e G => m£(X), and 

pe X => (3m = ape G^j p = ar x m G (G) . 

Now, by Dickson's lemma, X contains a finite subset FCI such that 
each p e X is a multiple of a power product in Y . 
Since FCX, clearly (F) C (X). Conversely, 



P&X => [3qeY 



q\p =>p€(Y). 



Thus (Y) = (X) = I, and Y is a finite basis of I. □ 
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2.2.2 Admissible Orderings on Power Products 
Definition 2.2.1 (Admissible Ordering) A total ordering < on the 

A 

set of power products PP(a;i, . . ., x n ) is said to be admissible if for all 
power products p, p' , and q <G PP(xi, . . ., x n ), 

1. 1 < p, and 

A 

2. p < p' => pq < p'q. 

A A 

Any total ordering that satisfies the second condition, but not necessarily 
the first, is called a semiadmissible ordering. 

Note that the only semiadmissible orderings on PP(a;) are 

l<x<x 2 < ••• <x n < ••• and 

• • • <x n < ■■ ■ <x 2 <x< 1, 

b b b b b 

of which only the first one is admissible. □ 

We also write p < q if p ^ q and p<q. Note that if a power product q is 

A A 

a multiple of a power product p, then p < q, under any admissible ordering 

A 

< : 

A 

p'p = q ; 
but 1 <p', and thus, p<p'p = q. 

A A 

Lemma 2.2.3 Every admissible ordering < on PP is a well- ordering. 

A 

PROOF. 

This follows from Dickson's lemma: Suppose we have an infinite descending 
sequence of power products 

Pi >Pi > ■■■ >Pi > ■■ ■ 

A A A A 

Let X = {pi, p2 1 ■ ■ ., Pi, ■ ■ ■} and let Y C X be a finite subset such that 
every p <G X is a multiple of some power product in Y . Let p' be the power 
product that is smallest in Y under the ordering < : 

A 

p = min Y. 

< 

A 

Since the power products in X constitute an infinite descending sequence, 

q<p' . 



p | q => y3 a power product, p'\ 



(3 q e X) 
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But, we know that 

(a P e y) [ p I q • 

Hence, 

p e P<q<P 
contradicting the choice of p' . □ 

Proposition 2.2.4 Let < and < be two semiadmissible orderings on 

X Y 

PP(X) and PP{Y), respectively. 

1. Define < on PP(X,Y) as follows: 

pq<p'q', where p,p e PP(X) and q, q' € PP(Y), 
if (i) p<p' , or (ii) p = p' and q<q' '. 

X ' Y 

2. L>e/?ne < on PP(X,Y) as follows: 

R 

pq<p'q\ where p,p' <E PP(X) anrf q, G PP(F), 

R 

z/ ('zj (7 < q' 7 or ^ii^ q = q' and p<p' . 

Y x 

TTien 6of/i < and < are semiadmissible. Furthermore, if both < and < 

L R X Y 

are admissible, then so are both < and < . □ 

L R 

Let p = x^x^ 2 ■ ■ ■ x^ n and q = x^a;^ 2 ■ ■ ■ x h ™ be two power products in 
PP(xi,X2, . . . , x n ). We define two semiadmissible orderings, lexicographic 
and reverse lexicographic as follows; their semiadmissibility follows from 
the above proposition. 

1. Lexicographic Ordering: ( > ) 

LEX 

We say p > q if aj 7^ 6j for some i, and for the minimum such i, 

LEX 

fli > bi, i.e., the first nonzero entry in 

(01, ...,a n )- (61, . . . ,6„) 

is positive. This is easily seen to be also an admissible ordering. 
Note that, x\ > x 2 > ■■■ > x n . For example, in PP(w, x, y, z), 

LEX LEX LEX 

we have: 

1 < 2 < z 2 < ■■■ 

LEX LEX LEX 

< y < yz < ••• < y 2 --- 

LEX LEX LEX LEX 

< x < xz < ■ ■ ■ < xy < ■ ■ ■ < x 2 ■ ■ ■ 

LEX LEX LEX LEX LEX LEX 

< w < wz < ■ ■ ■ < wy < • • • < wx < ■ ■ ■ < w 2 ■ ■ ■ 

LEX LEX LEX LEX LEX LEX LEX LEX 
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2. Reverse Lexicographic Ordering: ( > ) 

RLEX 

We say p > q if a; ^ hi for some i, and for the maximum such i, 

RLEX 

Oj < bj, i.e., the last nonzero entry in 

(oi, . . . ,a„) - (&!, . . . ,b n ) 

is negative. This ordering is scmiadmissible, but not admissible. 
Note that a; i > x 2 > ■■■ > x n . For example, in PP(w, x, y, z), 

RLEX RLEX RLEX 

we have: 

< z 2 < • ■ • < yz < ■ • ■ < xz < ■ • ■ < wz < z < 

RLEX RLEX RLEX RLEX RLEX RLEX RLEX RLEX RLEX 

< y 2 < ■ ■ ■ < xy < ■ ■ ■ < wy < y < 

RLEX RLEX RLEX RLEX RLEX RLEX RLEX 

< x 2 < ■ ■ ■ < wx < X < 

RLEX RLEX RLEX RLEX RLEX 

< w 2 < w < 1. 

RLEX RLEX RLEX 

Let p = x^x^ 2 ■ ■ ■ x a ™ and q — x^x^ • • • x b ™ be two power products in 
PP(xi,x 2 , ■ ■ ■ ,x n ). We say p y q, if deg(p) > deg(q). The order >- is 

TOT TOT 

only a partial ordering, and hence not an admissible ordering. However, 
we can make it an admissible ordering by refining it via a semiadmissiblc 
ordering. 

Let > be a semiadmissible ordering. We define a new ordering > 

A TA 

(the total ordering refined via > ) on PP(a;i, . . ., x n ) as follows: We say 

A 

p > q, if deg(p) > deg(g), or if, when deg(p) = deg(g), p>q. That is, the 

TA ' ' ' A 

power products of different degrees are ordered by the degree, and within 
the same degree the power products are ordered by > . 

A 

Lemma 2.2.5 The ordering > is an admissible ordering. 

TA 

PROOF. 

Since > is a refinement of the total degree ordering, for all p £ PP, 1 < p. 

TA * TA 

Now assume that p < p' . Then if deg(p) < deg(p'), deg(pq) < dcg(p'q) and 

TA 

PQ < P'l- Otherwise deg(p) = deg(p') and p<p' ■ Hence deg(pq) = deg(p'q) 

TA A 

and by the semiadmissibility of > , pq <p'q, and pq < p'q. □ 

A A TA 

The next two admissible orderings have important applications in com- 
putations involving homogeneous ideals 2 ; by the proposition above, both 
of them are admissible orderings. 



2 Roughly, a homogeneous polynomial is one in which every monomial is of the same 
degree, and a homogeneous ideal is one with a basis consisting of a set of homogeneous 
polynomials. 
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1. Total Lexicographic Ordering: ( > ) 

TLEX 

We say p > q if 

TLEX 

(a) dcg(p) > deg(q), or else, 

(b) In case deg(p) = deg(q), di ^ bi for some i, and for the minimum 
such i, di > bi, i.e., the first nonzero entry in 

(ax, . . .,a n ) - ...,b n ) 

is positive. 
For example, in PP(io, x, y, z), we have 

l<z<y<x<w<z 2 <yz<y 2 <xz<xy 

TLEX TLEX TLEX TLEX TLEX TLEX TLEX TLEX TLEX 

< x 2 < wz < wy < wx < w 2 ■ ■ ■ 

TLEX TLEX TLEX TLEX TLEX 

2. Total Reverse Lexicographic Ordering: ( > ) 

TRLEX 

We say p > q if 

TRLEX 

(a) deg(» > deg(g), or else, 

(b) In case deg(p) = deg(q), ^ bi for some i, and for the maximum 
such i, ai < bi, i.e., the last nonzero entry in 

(ai, . . . ,a n ) - (6i, ...,b n ) 

is negative. 
For example, in PP(w, x, y, z), we have 

l<z<y<x<w<z 2 <yz<xz< wz 

TRLEX TRLEX TRLEX TRLEX TRLEX TRLEX TRLEX TRLEX 

< y 2 < xy < wy < x 2 < wx < w 2 ■ ■ ■ 

TRLEX TRLEX TRLEX TRLEX TRLEX TRLEX 

Henceforth, fix < to be any admissible ordering. By an abuse of nota- 

A 

tion, we will say, for any two monomials m = ap and m' = a'p 1 (a, a 1 e S 
and p, p' € PP), m<m' , if p<p'. We also assume that every polynomial 

A A 

/ e R is written with its monomials ordered as a descending sequence un- 
der > , i.e., f = mi + m,2 H + mk, where each of nij's is a monomial of 

A 

/, and mi > m 2 > ••• >m k . 
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Definition 2.2.2 (Head Monomial) Let / € R be a polynomial. The 
head monomial Hmono(/) of / is the monomial in / whose power product 
is largest relative to < , i.e., if / = mi + rri2 + • • • + mk, then Hmono(/) = 

A 

mi. 

We define the head term of / (denoted, Hterm(/)) to be the power 
product of Hmono(/) and the head coefficient of / (denoted, Hcoef(/)) to 
be the coefficient of Hmono(/). Thus 

Hmono(/) = Hcocf(J) • Hterm(/). 

By convention, Hcocf(O) = Hmono(O) = 0. We use the notation Tail(/) to 
stand for / — Hmono(/). □ 

For instance, relative to any admissible total degree ordering, the head 
monomial of / = Axy + y — 5 is Hmono(/) = 4xy, when / is considered to 
be a polynomial inZ[n,rv]. Notice that if we consider / to be an element 
of (Z[^o])[rv], then, under any admissible ordering, the head monomial of 
/ is Axy + y. 

The lexicographic, total lexicographic and total reverse lexicographic 
(admissible) orderings play certain important roles in various computa- 
tions involving ideals, the last two of the above three admissible orderings 
being very crucial in the case of homogeneous ideals. The reasons for their 
importance are primarily the following: 

1. The lexicographic ordering has the property that for each subring 
S[xi, . . ., x n ] C R, and each polynomial / G R, / <E S[x i7 . . ., x n ] if 
and only if Hmono(/) <E S[x i7 . . ., x n ]. 

2. The total lexicographic ordering has the property that for each sub- 
ring S[xi, . . ., x n ] C R 1 and each homogeneous polynomial / e R, 
f € S[xi, . . ., x n ] if and only if Hmono(/) <G S[xt, . . ., x n ]. 

3. The total reverse lexicographic ordering has the property that for 
each homogeneous polynomial / € S[x\, . . ., Xi], Xi divides / if and 
only if Xi divides Hmono(/). 

As a result, there is an elimination algorithm (similar to Gaussian elim- 
ination for a system of linear equations) such that the elimination using 
the lexicographic ordering (total lexicographic ordering) produces elements 
of an ideal (homogeneous ideal) which are free of the first variable, and the 
elimination using the total reverse lexicographic ordering produces elements 
of a homogeneous ideal which are divisible by the last variable. 
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2.3 Grobner Bases 

Definition 2.3.1 (Head Monomial Ideal) Let G C R be a subset of 
i?. The /lead monomial ideal of G [denoted Head(G)] is the ideal generated 
by the head monomials of the elements of G, i.e., 

Head(G) = ({Hmono(/) : / e G}) . □ 

Definition 2.3.2 (Grobner Basis) A subset G of an ideal / C R in R 
is called a Grobner basis of the ideal I if 

Hcad(G) = Hcad(J), 

i.e., if the set of monomials {Hmono(f) : f G G} is a basis of Head(I). 
□ 

If, in the definitions of head monomial, head monomial ideal and Grobner 
basis, the underlying admissible ordering > is not readily decipherable, 

A 

then we explicitly state which ordering is involved by a suitable subscript. 

Notice that since G — I satisfies the above condition, every ideal has 
a Grobner basis. Also, an ideal can have many distinct Grobner bases. 
For instance, if G is a Grobner basis for /, then so is every G', GCG'C 
I. However, since a Grobner basis need not be finite, as such, it is not 
computationally very beneficial. Also, in general (for arbitrary ring S), we 
do not know how to compute a Grobner basis effectively. 

Further, notice that if G C I, then Head(G) C Head(I). Hence, 
to demonstrate that G is a Grobner basis of /, it suffices to show that 
Head(G) D Hcad(/). 

The following theorem justifies the term "basis" in a Grobner basis: 

Theorem 2.3.1 Let I C R be an ideal of R, and G a subset of I. Then 

Head(G) = Head(J) => (G) = /. 
That is, every Grobner basis of an ideal generates the ideal. 

PROOF. 

Since G C I, we have (G) C /. If (G) ^ I then we may choose an / € I\(G) 
such that Hmono(/) is minimal with respect to the underlying admissible 
well-ordering, say < , among all such polynomials. Thus, Hmono(/) G 

Head(J) = Head(G)t 

Hmono(/) = t i Hmono(g i ), ti G R, 

9.£G 
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and 

/' = Tail(/) - ]T ^Tail( 5i ) = / - £ t,.g, e /. 

Clearly, /' e I\(G), since, otherwise, / = f' + ^Ugi would be in (G). But, 
Hmono(f) <Hmono(/), since every monomial in Tail(/) as well as every 

A 

monomial in each of iiTail(gi) is smaller than Hmono(/); consequently, we 
have a contradiction in our choice of /. Hence, (G) = /, i.e., every Grobner 
basis of an ideal generates the ideal. □ 

Corollary 2.3.2 

1. Two ideals I and J with the same Grobner basis G are the same: 
I=(G) = J. 

2- If J C I are ideal of R, and Head(J) = Hcad(J), then J = I. 

PROOF. 

(1) is simply a restatement of the previous theorem. (2) follows from (1) 
since J is a Grobner basis for both / and J. □ 

Theorem 2.3.3 The subset G C / is a Grobner basis of an ideal I of R 
with respect to an admissible ordering > if and only if each polynomial 

A 

h G / can be expressed as 

h = ^2 ftQt, ft e R, 
such that Htcrm(/i) Htcrm(gi) < Hterm(fe). 

A 

PROOF. 

(=>) Let h E I. Inductively, assume that the theorem holds for all h' < h. 

A 

Since Hmono(fe) <G Head(I) = Head(G), it is possible to write 

Hmono(/i) = ^ OiPiHrnono^), a; E S, p { E PP(ii, . . . ,x n ) 

such that pi Hterm(gi) = Hterm(ft,). 
Let 

h' = Tail(fr) - ^ a iPi Tail(ffi) = h - ^ a l p l g i . 
Since Hmono(fe') < Hmono(fe), by the inductive hypothesis, we can write h 

A 

as 

h = h! + ^2 a *Pi9i = ^2 f'idi + ^2 a *Pi9i, 
such that Hterm(/ l ') Hterm(^) < Hterm(ft,') < Hterm(/i) and piHtcrm(gi) 

A A 

<Hterm(/i). 

A 
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(<=) Without loss of generality, we assume that h E / is expressed as 

h = a x p x g x -\ h a k p k g k , a; E S, p { E PP(ii, . . . ,x n ), g { E G, 

such that pi Hterm(gi) < Hterm(/i). 

A 

Let L be the set of indices such that 

L = ji e {1, . . . , k} : Hterm(/i) = pjHterm(<7j) j. 

Since Htcrm(/i) >pi Htcrm(gj), L ^ 0. 

A 

Equating terms of equal degree in the previous expression for h, we get 
Hcoef(/i) = X^iGL ffl»Hcoef(gj). Hence 

Hmono(/i) = Hcoef(/i) Hterm(/i) 

= ^ aiHcoef(gi ) Hterm(gi ) 

= ^a 4 p 4 Hmono(5 4 ), 

i.e., Head(G) D Head(I), and G is a Grobner basis of /. □ 

2.3.1 Grobner Bases in K[xi,x 2: ...,x n ] 

Let K be any arbitrary field, and K[x\,X2, ■ ■ ■ , x n ] be the polynomial ring 
over the field K in variables x\, x 2 , ■ ■ ., and x n . 

Theorem 2.3.4 Every ideal I of K[x\, x 2 , ■ ■ ., x n ] has a finite Grobner 
basis. 

PROOF. 

Let < be an arbitrary but fixed admissible ordering on PP(ii, . . ., x n ). 

A 

Let X = {Hterm(/) : / E 1} be a subset of PP(xi,a;2, . . . ,x n ). Then 
by Dickson's lemma, there is a finite subset Y C X such that every power 
product of X is divisible by a power product of Y. Define an injective map 
5>: Y — ► J, as follows: for each power product p E Y choose a polynomial 
g = E I such that Htcrm(g) = p. This map is well defined, since every 
p E Y is a head term of some polynomial in /; it is injective, since p, q E Y 
and p q implies that Htcrm(<I>(p)) ^ Hterm($(<j')), and < E > (p) ^ &(q)- 

Let G = <&(Y) C /. From the finiteness of Y, it trivially follows that G 
is finite. But, by proceeding as in the proof of Theorem 2.2.2, we see that 

Hcad(G) = (Y) = {X) = Head(7), 

and G is a Grobner basis for I. D 

Corollary 2.3.5 For any field K, 

1. Every ideal of K\x\, x 2 , • ■ • , x n ] has a finite system of generators. 

2. K[xi,X2, ■ • ■ , x n ] is a Noetherian ring. □ 
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2.3.2 Hilbert's Basis Theorem 

Proposition 2.3.6 Let R be a ring. Then the following three statements 
are equivalent: 

1. R is Noetherian. 

2. The ascending chain condition (ACC) for ideals holds: 
Any ascending chain of ideals of R 

h c h c • • • c i n c • • • 

becomes stationary. That is, there exists an no (1 < no) such that 
for all n>n Q , 7„ = I n . 

3. The maximal condition for ideals holds: 

Any nonempty set of ideals of R contains a maximal element ( with 
respect to inclusion). 

PROOF. 
(1^2): 

For a chain of ideals of R 

h c h c • • • C I n c • • • 

7 = U^Li In i s a l so an ideal of R. (f,g£l implies that for large enough 
n Qi f i 9 € Aioi hence / — g £ I m C /. f £ I implies that for large enough 
n Qi f € In 1 hence for all h e R, h ■ f e I m C 7.) 

By hypothesis, 7 is finitely generated: I = (/i, / 2 , . . . , f m ), G i?. For 
sufficiently large n we have /j £ I no (i = 1, . . . , m). Thus 

7 = (/i,/ 2 , . • • , / m ) C 7„ C 7„ 0+1 C • • • C 7, 

and for all n > n , 7„ = 7„ = 7. 
(2^1): 

Assume to the contrary. Then there is an ideal 7 of 7?, which is not finitely 
generated. If /i, / 2 , . . ., f m £ I, then (/i, / 2 , • • • , f m ) £ 7. Hence there is 
an / m+ i £ I, f m+1 (/i,/ 2 , . . . , / m ). Thus 

(/l; /2, ■ • • , fm) £ (/l, /2, ■ ■ ■ , /mi fm+l)- 

Thus we can construct an infinite (nonstationary) ascending chain of ideals 
(/i)£(/i,/ 2 )£(/i,/ 2 ,/ 3 )£---, 

in direct contradiction to our hypothesis. 
(2^3): 

Suppose there is a nonempty set T of ideals of R without a maximal element. 
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For each I\ G I there is an I 2 G T with I± C, I 2 . In this way one can 
construct a nonstationary ascending chain of ideals: 

h £ h £■■■£/„£■■■ , 

contradicting the hypothesis. 
(3^2): 

Apply the maximal condition to the set of ideals in a chain of ideals to 
obtain an I na , maximal among the ideals (under inclusion). Thus for all 
n > n , I no gL I n , i.e., I no = I n . □ 

Theorem 2.3.7 (Hilbert's Basis Theorem) If R is a Noetherian ring, 

so is R[x] . 

PROOF. 

Assume that R is Noetherian, but R[x] is not. We shall derive a contra- 
diction. 

Then R[x] must contain an ideal /, which is not finitely generated. Let 
/1 G I be a polynomial of least degree. If f k (k > 1) has already been 
chosen, choose fk+i, the polynomial of least degree in I\ f 2l ■ ■ ■ , /fe). 
Since / is not finitely generated such a sequence of choices can be carried 
on. 

Let n k = deg(/fc) and G R, the leading coefficient of fk (k = 1,2,...). 
Observe that 

• m < n<i < ■ ■ ■, simply by the choice of /^'s; 

• (ai) C (ai, a 2 ) C • • • (ai, a 2 , . . . , a fe ) C (ai, a 2 , . . . , a k , a k +i) C • • • is a 
chain of ideals that must become stationary, as R is Noetherian. That 
is, for some k, (ai,a 2 , ■ ■ ■ ,a k ) = (ai,a 2 , . . . , a k , a k +i), and a k+ i = 
b\a\ + b 2 a 2 H b k a k , b { G R. 

Now consider the polynomial 

9 = fk+i - hx^+^h b k x n «+i- n «f k . 

Notice that (1) deg 5 < deg/ fe+ i, (2) gel and (3) g (f u f 2 , . . . , f k ). 
[Otherwise, it would imply that f k+ i G / 2 , . . . , /fe).] But this contra- 
dicts our choice of f k+ \ as a least-degree polynomial in I \ (/1, / 2 , . . . , f k ). 
□ 

Corollary 2.3.8 

1. If R is a Noetherian ring, so is every polynomial ring R[x x , x 2 , . . . , x n \ . 

2. Let R be a Noetherian ring and S an extension ring of R that is 
finitely generated over R, in the ring sense. (S is a homomorphic 
image of a polynomial ring R[x±, . . . , x n ].) Then S is Noetherian. 

3. For any field K, K [x\, x 2l . . . , x n ] is a Noetherian ring. □ 
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2.3.3 Finite Grobner Bases 

Theorem 2.3.9 Let S be a Noetherian ring. Then every ideal of R = 

S[x\, x 2 , ■ ■ ., x n ] has a finite Grobner basis. 

PROOF. 

Since S is Noetherian, by Hilbert's basis theorem, so is R — S[xi, 
X2, ■ ■ ., x n ]- Let < be an arbitrary but fixed admissible ordering on 

A 

PP(xi,x 2 , ■ ■ ■ ,x n ). 

Let I be an ideal in R, and Head(i), the monomial ideal generated by 
the head monomials of the polynomials in /. Let us choose a polynomial 
gi £ I; if G\ = {gi} C / is not a Grobner basis of /, then Head(Gi) £ 
Head(I), and there is a polynomial g 2 & I such that Hmono(g2) € Head(/)\ 
Head(Gi). Clearly, G 2 - {51,32} Q I and Hcad(Gi) £ Hcad(G 2 ). 

In the (A: + l) th step, assume that we have chosen a set Gk = {<?i, <?2, ■ • 
gt} C /. Now, if Gfe is not a Grobner basis for /, then there is a gu+i € / 
such that 

Hmono(gfc + i) £ Head(J) \ Head(Gfe), 

and Gfe + i = G^ U {gk+i} C / and Hcad(Gfe) ^ Head(Gfe + i). But, since R 
is Noetherian, it cannot have a nonstationary ascending chain of ideals 

Hcad(G!) £ Hcad(G 2 ) £ • • • £ Hcad(G fe ) £ • • • , 

and there is some n > 1 such that Head(G„) = Head(J). But since G n C I, 
we see that G n = {gi,g 2 , ...,</„}■ is a finite Grobner basis for / with respect 
to the admissible ordering < . □ 

A 

2.4 Modules and Syzygies 

Definition 2.4.1 (Modules) Given a ring S, an Abelian group M, and 
a mapping 

H : S xM -> M 
: (s, x) > sx, 

we say M is an S-module if, for all s, t £ S and x, y £ M, the following 
axioms are satisfied: 

s(x + y) = sx + sy, 
(s + t)x = sx + tx, 
(st)x = s(tx), 
lx = x. 

Thus, an S'-module is an additive Abelian group M on which the ring S 
acts linearly. □ 
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If S is a field K, then a if -module is said to be a K- vector space. 

Note that if S is any ring, then any ideal I C S is an S-modulc. In 
particular, S itself is an S-module. Also, every Abelian group (G, +, 0) 
is a Z-module: here, the mapping (n, x) i— » nx (n € Z, x e G) has the 
following outcome: 



nx = < 



X + X + • • • + X 



if n > 0; 
if n = 0; 



(-x) + (-x) H h (-x) if n < 0. 



Let S {0} be a ring, T C 5, a multiplicatively closed subset and M, 
an S'-module. Consider the following equivalence relation "<~" on M x T: 



(V (xi,ai),(x 2 ,a 2 ) e M x T 

(xi, oi) — (x 2 , a 2 ) iff (3 a 3 g T) [a 3 (a 2 xi - x 2 a±) = 0] 



Let Mt — M x T / ~ be the set of equivalence classes on M x T with 
respect to the equivalence relation ~. The equivalence class containing 
(x, a) is denoted by x/a. Mt can be made into an S^-module with the 
obvious definitions of addition and scalar multiplication. Mt is called the 
module of fractions of M with denominator set T. 

Definition 2.4.2 (Module Homomorphisms) Let S be a ring and let 
M and N be S-modules. Then a mapping 



</>:M—>N 



is said to be an S-module homomorphism if, for all s g S and x, y G M, 

<p(x + y) = 4>{x) + <f)(y) and cj)(sx) = scf>(x), 

i.e., S acts linearly with respect to (p. 

Let 4> be an S'-module homomorphism as before. We define the kernel 
of 4> to be 

ker <j) — {x e M : (f>(x) = 0} 
and the image of 4> to be 

im</> = {>(x) eJV : i£ M}. 

It can be verified that ker0 and im^ are both S-modules. □ 
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Definition 2.4.3 (Submodule) Let 5 be a ring and M an S-modulc. 
Then M' is a said to be a submodule of M if M' is a subgroup of M and 
M' is an S- module, i.e., M' is closed under multiplication by the elements 
of S. □ 



Definition 2.4.4 (Quotient Submodule) Given S, M and M' as in the 

previous definition (Definition 2.4.3), we make the quotient Abelian group 
M/M' an S-module by allowing it to inherit an S-module structure in a 
natural manner. In particular, we make the natural definition for multipli- 
cation in M/M 1 : for s e S and x € M, 

s(x + M') = sx + M'. 

This definition is consistent, since, if x + M' = y + M', i.e., x — y € M' 
then s(x — y) = sx — sy e M' and sx + M' = sy + M' . The axioms for an 
S'-module (as in Definition 2.4.1) follow quite easily. 

The S'-module M/M' thus defined is called the quotient submodule of 
M by M', and the mapping 

: M°4°M/M' 
: ihi|M', 

is a surjective S'-module homomorphism. D 



Definition 2.4.5 (Module Operations) Let S be a ring, I be an ideal 
of S, M be an S'-module and 

M = (Mi) ieI , 
a family of submodulcs of M; then: 

1. Sum: 

E M _ I _ Xi e M i; and all but finitely I 

1 | ^ 1 ' many of Xi's are zero | 
iei Kiel ) 

is a submodule of M. Thus M consists of all sums formed by 
taking exactly one element from each submodule in a finite subfamily 
of M. 

2. Intersection: 

iei 

is a submodule of M. 
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3. Product: 

n 

IM = l^a^Xi : di £ I,Xi £ M and n £ zj, 

i=l 

is a submodule of M. 

4. Quotient: Let AT and P be two submodules of M: 

N:P={aeS:aPCN}, 

is an ideal of S. The quotient : M is an ideal and is called the 
annihilator of M (denoted annM). 

An 5-module M is faithful if ann M = 0. □ 



Definition 2.4.6 (Generators) Let 5 be a ring and let M be an S- 
module. Note that, for any x £ M, 

& = {si : s e 5} 

is a submodule of M. Let 

= {xi} ie i 

be a (possibly, infinite) subset of M. X is said to be a system of generators 
of M, if 

iei 

Equivalently, A" is a system of generators of M if every element x £ M can 
be expressed in the form 

ieJ 

where J C / is a finite subset of the index set and Si £ S and ^ € ^. □ 

If S 1 is a ring and M is an 5-module, M is said to be finitely generated if 
M has a finite set of generators, and cyclic (or monogenic) if it is generated 
by only one element. A system of generators u\, . . ., u n of an S'-module M 
is a basis of M, if 



^ aiUi = =^> *) 



a, = 



i.e., M has a linearly independent system of generators. M is called free 
(of rank n) if it has a basis of size n. 
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If S is a ring, then it is natural to make S n into an S'-module, M, by 
defining, for any (s\, . . ., s n ), (t\, . . ., t n ) G S n and any s e 5, 

= (0,...,0), 

(si,...,s n > + t„) = (si +t 1 ,...,s n + t n ) and 

s(*i,...,t„) = (sti, . . . ,st n ). 

It is easy to see that M = S n is a free S'-module of rank n. 

Definition 2.4.7 (Noetherian Modules) An S'-module M is called Noe- 
therian if every submodule N of M is finitely generated. □ 



Proposition 2.4.1 If S is a Noetherian ring, then S n (n < oo) is a 

Noetherian S-module. 

PROOF. 

Let N be a submodule of S n . We proceed by induction on n: If n = 1, then 
there is nothing to prove, since, in this case, N C S 11 = S* is a submodule 
and hence an ideal in S*, thus possessing a finite set of generators. If n > 1, 
then let 

/ = [s e S : (3 s 2 ,.. .,s„ e S) [(s,s 2 ,...,s„) € A]}. 

/ is clearly an ideal in 5*, so / has a finite set of generators {si,i,...,si,fc}. 
Pick si, . . .,~s~k £ N such that for i = 1, . . ., k, si has as first component 
Sij. For an arbitrary element s = (si, S2, ■ • ■, s n ) € iV, we can express Si 
as 

fe 

si = y^rjSi.j, for some n, . . . ,r k e 5. 

Now, note that 

k 

s' = s -^2,nsi 

i=l 

is of the form (0, s%, ■ ■ ., s^) <E iV', where iV' is the following submodule of 
5": 

N' = {s= (0,82,..., s n ) :seN}. 

But the mapping 

</> : N' —* S"- 1 

: (0, S2,.-.,s n ) i ^ (s 2 ,...,s„) 

is a homomorphism with the kernel (0, 0, . . ., 0). Thus <f) is an isomorphism 
of N' into its image im0 C Hence, by induction, im0 a submodule 
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of S n 1 has a finite system of generators, and so does N'. Let {ti, . . ., U} 
be such a system of generators. Then, since 

k I 

s' = s- r{s~ = ^ Qiti, where q u n E S 
i=i i=i 

and 

k I 

s = ^ + g^, 

i=l i=l 

{sT, . . ., Sfc, ti, . . ., i;} is a finite system of generators of N. □ 

Definition 2.4.8 (Syzygies) Let S be a ring and let M — (xi, . . ., x q ) 
be a finitely generated ^-module. Note that 

4> : S q -+M 

: (Sl, . . . , S q ) h-> SiXi H h s g x 9 

is an S'-module homomorphism. Thus 

K = kcrcf) = {(si, . . . ,s q ) e S q I sixi + . . . + s ? a; g = 0}, 

is a submodule of S q ; K is said to be the {first module of) syzygies of M 
[with respect to the system of generators {xi, . . ., x q } of M] and is denoted 
S(M). □ 

Proposition 2.4.2 If S is a Noetherian ring and M is a finitely generated 
S-module, then S(M), the syzygy of M , is finitely generated. 

PROOF. 

If M — (xi, x 2 , ■ ■ ., x q ), then S(M) is a submodule of a Noetherian S'- 
module, S q . Thus, by Proposition 2.4.1, S(M), the syzygy of M is also 
finitely generated. □ 

Given S, a Noetherian ring, and M = (xi, . . ., x q ), a finitely generated 
S-module, we have S(M) = (s7, . . ., s^) where 

51 = (Sl,l,Sl,2, • • • ,Sl, q ) 

52 = ($2,1,52,2, • • • ,S2,q) 

Sp — ($p,l, Sp : 2: • ■ ■ , Sp,q) 

and, for i = 1, . . ., p 

Si,lXl H h Si^ q X q = 0. 
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If U1X1 + U2X2 + • • • + u q x q = 0, then (tii, U2, ■ ■ ., u q ) = u £ S(M); so 
there are «i , «2 , • ■ • , v p £ S such that 

tt = wisT + U2S2 H h v P V 

This is equivalent to the following: 

Ul = flSl,l + U2S2.1 H hWpSp,l 

U 2 = V!S lt 2 + V 2 S 2 ,2 H h V p S Pt 2 

Uq = V\Sl iq + V 2 S 2 .q H +VpS p , q . 



2.5 ^-Polynomials 

Definition 2.5.1 (S- Polynomials) Let S be a ring; R — S[xi, . . ., x n ] 
be a ring of polynomials over S; G C i? be a finite subset; and 

F = {/i,...,/,}CG. 

Set 

T = {*! = Hcoef(/i), ...,t q = Hcocf(/ 9 )} C S 

and 

J = (ti, . . . ,t q ), 

the ideal in S generated by T. From the previous discussion, we can write 
the syzygy of J, S(J) C S* 9 , by its system of generators, {sT, . . ., s^}, where 
s7 £ S q , i = 1, . . ., p. That is, 

S(J) = ( (si,l,Sl, 2 ,...,Sl, 9 ), 



( s p,li s p,2j ■ • ■ 7 s p,<j) J • 

We define the set of S-polynomials of F, which we denote by SP(F), to be 
the set {/ii,f, . . ., h Pt p}, where, for i = 1, . . ., p, 

_ SM ■ Hterm(/i) ' h + ' ' ' + S< ' 9 ' Hterm(/ g ) ' /<? ' 
where m = LCM(Hterm(/i), . . . , Hterm(/,)). □ 
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Note that if wc simplify the expression for an ^-polynomial, say hi t F, 
we get 

TO 

hi,F = i^i • m + Si,i • — — --Tail(/i)H 

Hterm(/i) 

in 

+ S ^' m + S --Ht^n^' Tail(/ « ) 

= (s»,iti H h Si iq t q )m 

+ ^ ' ffi£(A) • TaU(/l) + • • • + • idnC/-) ' Tail(/ ^ 



77Z 777- 

s- 1 ■ hma) ' Tail(/l) + ' ' ' + «•« ' ' Tail(/ ^ 



Thus, Hterm(ft iiF ) < m = LCM^Hterm(/ l ) : G F) . 

Theorem 2.5.1 Let F = {f\, . . ., f q } be as in the previous definition. 
Assume that for some u\, . . ., u q G S, p\, . . ., p q G PP(xi, . . x n ), 

Q 

h = ^2uiPifi, 
»=i 

and that p\ Hterm(/i) = • • • = p 9 Hterm(/ g ) = M > Hterm(/i). 
Thus, 

q 

^ WiPi Hmono(/i) = 0. 

i=l 

Then we can express h in terms of the S'-polynomialso/ the set F as follows: 

p 

h = ^2v jtF r F h jiF , 

where vj t p € S, rp G PP(xi 7 . . ., x n ) and hj : p's are S'-polynomialso/ F, 
SP(F). Furthermore, 

rF Hterm^/ij^ < M = Pi Hterm(/j). 

PROOF. 

Let 

ti = Hcoef(/i), . . . ,t q = Hcoef(/ 9 ) and J = (ti, . . . ,t q ). 
Let a system of generators for the syzygy of J, S(J), be given as follows: 



S(J) = ( (si,l,Sl,2,---,Sl, 9 ), 

(•Sp,l, s p,2, ■ • ■ , s p,<j) ^ • 
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Since 

i i 
^ UiPi Hmono(/,) = M ■ ^ = 0, 

i=l i=l 

there exist vi, . . ., u p G 5 such that 

p p 

Ul = E v o s i,^ ■■■> u i = E v o s m- 
i=i j=i 

Now, if we let m - LCM(Hterm(/ 1 ), . . . , Hterm(/ 9 )) , then it is clear that 

m | pjHterm(/j), for all i, 
i.e., there is some power product r € PP(xi, . . ., x„) such that 

m • r = M = pi ■ Hterm(/i), for all i. 
Thus, we can rewrite h as follows: 



p 



h = ^ v iSi l I r ■ nrr ■ fi + 

I ^ 3 J ' I Htcrm(/i) J 



vi =1 



E^-J^Hterm^)-^ 

= uir ■ fti,F H \-v p r-h PtF - 

Also, for all i, Hterm(r) Hterm(/ij i f) < r ■ m = pi Hterm(/j). □ 



Definition 2.5.2 (Syzygy Condition for a Finite Set G) Let S be a 

ring, and let 

G = {51,52, • • -,5m} C S[xi, . . . ,x„], 

be a finite set of polynomials. We say G satisfies the syzygy condition if 
for every subset F C G and every S'-polynomial /i e SP(F), h can be 
expressed as 



where fi e 5[xi, . . . , x„] and Hterm(/i) > Hterm(/j) Hterm(t/j). □ 



Now we are ready to give a new characterization of a Grobner basis in 
terms of the syzygy condition: 
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Theorem 2.5.2 Let I be an ideal in R — S[xi, . . ., x n ], 

G = {gi, ■ ■ -,g m } C /, 
a finite subset of I. Then the following three statements are equivalent: 
1. Hcad(G) = Head(J). 



2. 



(v / e /) [/ = E 



fi9i 



, where 



fi G S[xi, . . ., x n ] and Hterm(/) > Hterm(/j) Hterm(grj), for all i. 

A 

3. (G) = I and G satisfies the syzygy condition. 
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PROOF. 
(1*2): 

The equivalence of (1) and (2) is simply Theorem 2.3.3. 
(2=^3): 

Note first that (2) establishes G as a Grobner basis (by Theorem 2.3.3) and 
that (G) = I (by Theorem 2.3.1). Furthermore, if F C G and h e S*P(F), 
then h £ I; thus by condition (2) itself, 

ft = XI ^ 5i > 

where fa € 5[xi, . . . , x„] and Htcrm(/i) > Hterm(/j) Hterm^). But this is 

A 

precisely the syzygy condition. 
(3=>2): 

We assume to the contrary, i.e., condition (3) holds, but not (2); we shall 
derive a contradiction. 

We first define height of a sequence 

H = (hi : 1 < i < m, and hi € 

(with respect to G and the admissible ordering < ) as 

A 

Height(i7) = max |Hterm(/ij) Hterm(<7j) : 1 < i < mj. 

A 

Since by (3), (G) = /, every f £ I can be expressed as 

y^figu fi e S[xi, . . . 

i=l 

Furthermore, since (2) is assumed to be false, we can choose an / € / such 
that 

Height((/i,...,/ m )) >Hterm(/). 

We assume that the representation of / is so chosen that it is of a minimal 
height, M. Let 

F = {j,eG: Hterm(/j) Htcrm(.g t ) = m}. 

Without loss of generality, we may assume that F consists of the first k 
elements of G. Thus 

k km 

/ = ^Hmono(.f l )5* + X Tail (/^+ £ fi9u 

i—1 i—1 i— 
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where 

Hterm(/i) Hterm(fiij) = M, l<i<k, 
Hterm(Tail(/ l ))Hterm(3i) < M, 1 < i < k and 

A 

Hterm(/ i ) Hterm(#j) < M, k+l<i<m. 

A 

Since / < M, we see that 

A 

Hterm(/i)Hterm(gi) = ••• = Hterm(/ fe ) Hterrn(# fe ) 

k 

= M > Htermfy^ Hmono(/j)gi) . 
A i=i 

As noted before (Theorem 2.5.1), we see that the expression ^2 i=1 Hmono(f i )g i 
can be expressed in terms of SP(F), the S-polynomials of F. Thus we may 
write / as 

k m 

/ = ^ Vj,F r F h jtF + ^2 Tail (/i)5i + X! 

j i—l i—k+1 

But since rp Hterm(/i Ji ir) < Hterm(/j) Hterm(^) = M, 1 < i < k, 

A 

and since, by the syzygy condition, each 

hj.F = ^2 9% 

i 

with Hterm(/i Ji ^) > Htern^/^.p) Hterm(^), we get 

A 

k m 

f = ( v i,F TF 9i + Yl Tail(fi)gi + ^ f l g i . 

j i i=l i=fe+l 

But, by the previous arguments, 

Htcrm^w^i? rp fij,F^ Hterm(gi) < Htcrm(ri?) Hterm(/j i j t p) Hterm(gj) 

< Htcrm(rF) Hterm(/i JiF ) < M, 

A ' A 

and hence, we can express / differently as 

i=l 

with Height ((/{, . . . , f' m )) < M, and thus contradicting our choice of the 

A 

representation of / relative to G. □ 
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Problems 

Problem 2.1 

Show the following: 

(i) An element r G R of the ring R is a unit if and only if (r) = (1). 

(ii) If r is a unit and x is nilpotent in the ring R, then r + x is again a 
unit. 

Problem 2.2 

Prove that if I\ and I2 , as well as I\ and ^3 are coprime, then I\ and 
I2 I3 are coprime. Hence, if I\, I 2 , ■ ■ • , are pairwise coprime, then 



Problem 2.3 

Let Ji , I2 C ii be two ideals with bases B\ and B 2 , respectively. Which 
of the following statements arc true? Justify your answers. 

(i) -Bi U B 2 is a basis for ii + i 2 . 

(ii) {bi ■ b 2 : ^1 € -Si and &2 G ^2} is a basis for iii 2 . 

(iii) {6^612 ■■■&»„ : bi!,bi 2 i ■ ■ ■ i G i?i} is a basis for If. 

(iv) Let f £ R, and Ci, a basis for ii n (/). Then, every ci G Ci is 
divisible by /, and {ci// : Ci G Ci} is a basis for (ii : (/)). 

Problem 2.4 

Let I, J, if be ideals in a ring R. Prove the following: 

(i) Modular law: It I D J or ID K, then in (J + K) = (IC\J) + (Ir\K). 

(ii) I C I : J, and (i : J) J C i. 

(iii) (0, h ■ J) — Hiili ■■ J), and (i : J,) - QiU = ■*)■ 

(iv) VT D I, and = \/i. 

Problem 2.5 

For each fc = 1, . . . , n, we define a function Uk as follows: 

[4 : PP(ii,i 2 ,...,s n )->R 

: p = x^x^ 2 • • • a:£™ h-» u^iai + u fcj2 a2 H + u kiU a n , 

where each of Uk,i (k = 1, . . . , n, I = 1, . . . , n) is a nonnegative real number. 
Assume that C/fe's are so chosen that (U\(p), . . . , U n (p)) = (0, . . . , 0) if and 
only if p = 1. Wc define an ordering > on the power products as follows: 



given two power products p, q G PP(xi,x 2 , ■ ■ ■ , x n ), we say p > q, if the 
first nonzero entry of 



ii n h n • • • n i, 



u 



<J7i(p),C^(p) 



,...,^n(p)>-<^l(g),^2(?),...,^n(«)> 
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is positive. 

Show that the ordering > is an admissible ordering. Characterize the 

u 

admissible orderings > , > and > in terms of appropriately chosen 

LEX TLEX TRLEX 

functions Uk- 



Problem 2.6 

Let (x 2 + y 2 ), (xy) C Q[s~\, rx] be two ideals with Grobner bases {x 2 + 
y 2 } and {xy}, respectively, with respect to the > (with x > y). Is 

TRLEX TRLEX 

{x 2 + y 2 ,xy} a Grobner basis for (x 2 + y 2 ) + (xy) under > ? 

TRLEX 

Problem 2.7 

Let < be a fixed but arbitrary admissible ordering on PP(xi, . . ., x n ). 

A 

Consider the following procedure (possibly, nonterminating) to compute a 
basis for an ideal in the polynomial ring R = S[x\, . . . , x n ]\ 

G := {0}; 

while (G) / I loop 

Choose f € I \ (G) , such that Hmono(/) is the smallest 
among all such elements with respect to < ; 

G:=GU{/}; 
end{loop } 

Which of the following statements are true? Justify your answers. 

(i) The procedure terminates if S is Noetherian. 

(ii) The procedure is an effective algorithm. 

(iii) Let fi be the element of I that is added to G in the i th iteration. 
Then Hterm(/i) < Hterm(/ 2 ) < • • • < Hterm(/„) < • • • 

A A A A 

(iv) The set G at the termination is a Grobner basis for /. 
Problem 2.8 

Consider the polynomial ring R = S[x\, . . ., x n ], with total reverse 
lexicographic admissible ordering > such that x\ > ■■■ > x n . 

TRLEX TRLEX TRLEX 

The homogeneous part of a polynomial / € R of degree d (denoted fd) is 
simply the sum of all the monomials of degree d in /. An ideal / C R is 
said to be homogeneous if the following condition holds: f E I implies that 
for all d > 0, f d e I 

Prove that: If G is a Grobner basis for a homogeneous ideal I with 
respect to > in R, then G U {xi, . . . , x n } is a Grobner basis for (I, x i: 

TRLEX 

■ ■ ■, x n ) (1 < i < n) with respect to > in R. 

TRLEX 

Hint: First show that (Head(Z), Xi, . . . , x n ) — Head(7, Xi, . . . , x n ), 
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Problem 2.9 

Let if be a field and I an ideal in K[x\ , #2, . . ., x n ] . Let G be a maximal 
subset of I satisfying the following two conditions: 

1. All / e G are monic, i.e., Hcoef(/) = 1. 

2. For all / e G and 5 <E I, Hterm(/) 7^ Htcrm(<7) implies that Hterm(g) 
does not divide Hterm(/). 

(i) Show that G is finite. 

(ii) Prove that G is a Grobner basis for I. 

(iii) A Grobner basis G m i n for an ideal I is a minimal Grobner basis 
for /, if no proper subset of G m i n is a Grobner basis for /. 

Show that G (defined earlier) is a minimal Grobner basis for I. 

(iv) Let G be a Grobner basis (not necessarily finite) for an ideal I of 
K[xi,X2, • ■ • , x n ]. Then there is a finite minimal Grobner basis G'CG for 
I. 



Problem 2.10 

(i) Given as input ao, ai, . . ., a n and 60, 61, . . ., b n integers, write an 
algorithm that computes all the coefficients of 

(ao + a\x -\ h a n x n )(b + 61a; H h b n x n ). 

Your algorithm should work in 0(n 2 ) operations. 

(ii) Show that given M and v, two positive integers, there is a unique 
polynomial 

p(x) = po+ pix H h p n x n 

satisfying the following two conditions: 

1. po, Pi, ■ ■ p n & re a U integers in the range [0, M — 1], and 

2. p(M) = v 

(iii) Devise an algorithm that on input ao, 01, . . ., a n ; M integers 
evaluates 

a + a 1 M H h a n M n . 

Your algorithm should work in 0(n) operations. 

(iv) Use (ii) and (iii) to develop an algorithm for polynomial multi- 
plication [as in (i)] that uses 0(n) arithmetic operations (i.e., additions, 
multiplications, divisions, etc.). 

(v) Comment on your algorithm for (iv). What is the bit-complexity 
of your algorithm? 
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Solutions to Selected Problems 

Problem 2.3 

All but the last statement are true. Since the proofs for the first three 
assertions are fairly simple, we shall concentrate on the last statement. 

(iv) Here is a counterexample (due to Giovanni Gallo of University of 
Catania): 

Let R = Q[V\ rv]/(rv^) be a ring, / = y 2 G R and I\ = (xy), an ideal 
in R. Since xy 2 G (y 2 ) and xy 2 G (xy), clearly, xy 2 Elf) (/). Conversely, 
if a G / fl (/), then a must be r • xy 2 for some r G R. Hence, a G (xy 2 ), 
and (xy 2 ) = /n(/). 

On the other hand, as, for all k > 3, 

/•/E^EOmod (y 5 ), 

y fc G / : (/). But, for any k > Q, y k ^ (x). Therefore, {x} is not a basis for 
/ : (/). In fact, for this example, {x, y 3 } is a basis for I : (/) = (xy) : (y 2 ), 
for the following reasons: If a G (xy) : (y 2 ), then the following statements 
are all equivalent. 

ay 2 G (xy) ay 2 G (xy) n (y 2 ) = (xy 2 ) 

ay 2 = rxy 2 mod (y 5 ), r G i? 

<^ (a - rx)y 2 = sy 5 , s G i? 

<=> a = rx + sy , r,sGR. 

However, if / G R is not a nonzero zero divisor, or if R is an integral 
domain, then the statement holds. Let s^Oel: (/). Then af G /, and 
a/ 7^ 0. Thus, a/ = X) c ec r i c i anc ^ a = J2c-ec r i°i/ f ex ists. Hence, a G 
({c/f : c G C}) . As, for all a G C, c;/ '/ G J : (/), we see that {c// : c G C} 
is a basis for J : (/). 

In general, the following holds: 

Let f G R, C be a basis for Id(f), and D be a basis for ann (/). 
Then, every c G C is divisible by f , and {c/f : c G C} U D is a 
basis for I : (/). 

af el & afeln (f) = (C) 

r lCi = 0, n G R 

c % ec 
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« «e I ce cjuD 

Thus {c// : c e C} U D is a basis for I : (/). 

Problem 2.7 

Let Gi denote {/i, . . ., ft}. 

(i) True. The termination of the procedure directly follows from the 
ascending chain condition (ACC) property of the Noetherian ring as 

(Gi)£(G 2 )£ (G 3 )£-". 



(ii) False. This algorithm is not effective because it does not say how 
to find/e/\(G). 

(iii) True. The condition Hterm(/i) < Hterm(/2) < • • ■ follows immedi- 

A A 

ately from the fact that, at each step, we choose an fi such that Hmono(/i) 
is the smallest among all elements of 7\ (Gj_i). Indeed, if it were not true, 
then there would be i and j (j < i) such that 

Hterm(/j) < Hterm(/j). 

A 

Assume that among all such elements fi is the first element which violates 
the property. But, since (Gj_i) C (Gj_i), and since fi & I \ (Gj_i), we 
have ft G I \ (Gj-i). Since, Hterm(/j) < Hterm(/j), the algorithm would 

A 

have chosen fi in the j th step, instead of fj, contradicting the hypothesis. 

(iv) False. Consider the ideal I — (xy, x 2 + y 2 ) C Q[r\, rv]. Let the 
first polynomial chosen be /i — xy e I \ (0) as Hmono(/i) is the smallest 
among all such elments (with respect to < )• Similarly let the second 

TRLEX 

polynomial chosen be fi = x 2 + y 2 e I \ (xy) as Hmono(/2) is the smallest 
among all such elments (with respect to < ). As I = (/i, / 2 ), the 

TRLEX 

algorithm terminates with G = {/i, / 2 }. Thus Hcad(G) = (xy, x 2 ). Since 
y 3 = y(x 2 + y 2 ) — x(xy) e I, we have y 3 e Head(7) \ Head(G), and G is 
not a Grobner basis of I. 

Problem 2.8 

Since G C 7, we have (G, Xi, . . ., x n ) C (7, Xi, . . ., x n ) and Head(G, x i} 
■ ■ ■, x n ) C Hcad(7, Xi, . . ., x n ). Hence, we only need to show that Head(G, 
Xi, ■ ■ ., Xn) 3 Head(7, Xi, . . ., x n ). 

Following the hint, we proceed as follows: Let fj e (7, Xi, . . ., x n ) be a 
homogeneous polynomial of degree d. Then, if Hmono(/d) £ (xi, . . ., x n ), 
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then, plainly, Hmono(/d) G (Hcad(J), Xi, . . ., x n ). Otherwise, Hmono(/d) 
is not divisible by any of the Xj's and fd can be expressed as follows: 

fd = 9d + hiXi H h h n x n , g d G /, and hi,...,h n € S[xi, . . . , x n \. 

Since Hmono(/d) G PP(xi, . . ., x n ), by the choice of our admissible 
ordering, Hmono(/d) > Hmono(/i,i,), and thus 

TRLEX 

Hmono(/d) = Hmono(<7d) € Head(I) C (Head(I), Xi, . . . , x n ). 

This proves that 

(Head(7),a; i , . . . ,x n ) D Head(7, x i; . . . , x n ) 

D (B.ea,d(I),Xi,...,x n ). 

Since Head(G, Xi, . . ., x n ) D (Hcad(G), x i} . . ., x n ), using the hint, we 
have 

Head(G,Xi, . . . ,x n ) 

D (E.e&d(G),Xi,...,x n ) 

= (Head(I), Xi, . . . , x n ) (G is a Grobner basis for I) 

D Head(7, Xi, . . . ,x n ), 

as claimed. 



Problem 2.9 

Since K is a field, for all a G K , a -1 G K . Hence, 

(V / G /) (3 /' G /) Hterm(/) = Hterm(/'), but Hcoef(/') = 1 

Moreover, for all a G K, a f G /. 

(i) Let the set of power products, G, be the set of head terms of G: 

G = {Hterm(/) : / G G}. 

Let Htcrm(/) and Htcrm(g) be two distinct power products in G. Then 
/ G G C / and g G G C /. Thus, by condition (2), Hterm(/) is not a 
multiple of Htcrm(g), nor the converse. But, then by Dickson's lemma, G 
is finite, and so is G, since there is a bijective map between G and G. 

(ii) We claim that 

(v/ie/j (3 f G G) Htcrm(/i) = p ■ Hterm(/) 
where p G PP(xi, . . . , x n ). 
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Indeed if it were not true, then we could choose a "smallest" h £ I [i.e., 
the Htcrm(/i) is minimal under the chosen admissible ordering < ] violating 

A 

the above condition. Without loss of generality, we assume that h is monic. 

If g £ I is a polynomial with a distinct Hterm from h such that 
Hterm(g) divides Hterm(/i), then Hterm(g) < Hterm(/i), and by the choice 

A 

of h, Hterm(g) is a multiple of Hterm(/), for some / £ G. But, this con- 
tradicts the assumption that Hterm(/i) is not divisible by Hterm(/), as 
feG. 

Thus, for all g £ I, Hterm(/i) ^ Hterm(g) implies that Hterm(g) does 
not divide Hterm(/i). But, if this is the case, then G U {h} also satisfies 
conditions (1) and (2), which contradicts the maximality of G. 

Now, we see that if h £ I, then 

Hmono(/i) = Hcoef(/i) • Hterm(/i) = Hcoef(/i) • pHterm(/), 

where / £ G, p £ PP(ii, . . . , x n ) 
=> Hmono(ft) £ Head(G). 

Therefore Head(J) C Hcad(G); hence G is a Grobner basis for /. 

(iii) Suppose G is not a minimal Grobner basis for /. Let G' £ G be 
a minimal Grobner basis. Let / £ G\G' . By the construction of G, for 
all g £ G' C I, Hterm(<7) does not divide Hterm(/). Thus Hterm(/) £ 
Head(G) \ Hcad(G'). This leads us to the conclusion that 

Hcad(G') ^ Hcad(G) = Head(7). 

Thus G' is not a Grobner basis, as assumed. 

(iv) Let G' C G be a minimal Grobner basis for I. Without loss of 
generality, assume that all the polynomials of G' are monic. Let /, g £ G 
with distinct Hterm's. We claim that Hterm(/) does not divide Hterm(g). 
Since, otherwise, 

Head(J) = Head(G') = Head (G' \ {g}) , 

and, G' \ {g} £ G would be a Grobner basis for I, contradicting the 
minimality of G'. Thus, by Dickson's lemma the set of power products, 

G 7 = {Hterm( ff ) | g £ G'}, 

is finite and so is G' . 

Problem 2.10 

(i) Let C = {c 2 „, C2 n -i, . . ., Co} be the coefficients of 

(a n x n H h a\x + a ) ■ (b n x n H h b\x + b ). 
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Then 

i 

Vj,i<j<2n c i = X! a i ' bi -i> wner e a*, 6 l = 0, for z > n. 

3=0 

Since Cj is a sum of at most n products, it can be computed using 0(n) 
arithmetic operations. Hence C can be computed in 0(n 2 ) time assuming 
that each arithmetic operations takes O(l) time. 

(ii) Let 

p(x) = po +pix H Vp n x n . 

Consider 

n = min { M J > v } — 1 , and 
Pi = Quotient (v, M l ) mod M, < i < n. 
It is obvious that < Pi < M and also 

n n 

p(M) = ^ Pl M i = [Quotient (v, M l ) mod M] ■ M i 

2 = 1=0 

n 

= Y [Quotient (v, M l ) 

z=0 

- Quotient (Quotient (v, M l ) ,M) ■ M]- M i 

n 

= Y [Quotient (v, M*) • M l - Quotient (v, M i+1 ) ■ M i+1 ] 

= Quotient (v,l)- Quotient (v, M n+1 ) ■ M n+1 . 

But M n+1 > v, therefore Quoticnt(>, M n+1 ) = which implies p(M) = v. 

The uniqueness follows from the fact that p = (p„, . . ., po) gives the 
unique representation of v in radix M. Since each pi can be computed 
using O(l) arithmetic operations (using the fact that M l — M 1 ^ 1 ■ M), we 
can find all pi in 0(n) time. 

(iii) We can write the polynomial as 

a + M (oi + • • • + M (o n _i + Ma n ) ■ ■ •) . 

If we compute the above expression from the innermost level to the outer- 
most, we need only 0(n) arithmetic operations and therefore p(M) can be 
computed using 0{n) operations. 

(iv) Let 

A[x) = fl + flis H ha„x™, 

B[x] — b + b\x + • • • + b n x n , and 

C[x] = A[x] ■ B[x\. 
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Assume that 

a = max{o , a\, . . . , a n , b Q ,bi,..., &„}; 
choose M = (n + 2) - a. We claim that 

Cl = Quotient (A[M] ■ B[M],M*) mod M. 

c * = Ej=o a J b i-3 < {n+2)a = M. On the other hand C[M] = A[M]-B[M] 
and therefore from the previous part it follows that there is a unique de- 
composition of C[M], such that it can be written as a polynomial whose 
coefficients are the same as in the above equality. Hence, the above ex- 
pression correctly gives the coefficients. Moreover A[M] and B[M] can be 
computed using 0(n) arithmetic operations, and once we have C[M], we 
can obtain all using 0(n) arithmetic operations which shows that the 
total operations required are bounded by 0(n). 

(v) Since M = (n + 2)o, it needs b — 0(logn + logo) bits. Using 
Schonhagc-Strassen's algorithm, two n-bit integers can be multiplied in 
0(n log n log log n) time. In our cases the largest number that we multiply is 
M n which requires nb bits. Hence the time complexity of one multiplication 
is 0(n&(logn&)(loglogn&)) and since there are 0(n) multiplications, the 
total bit complexity is bounded by 0(ri 2 6(logn&)(loglogn&)). 

Note: {Matrix Multiplication) Given two n x n matrices, A and B, we can 
compute C = A- B, using 0(n 2 ) operations as follows: We can represent A 
as a vector of its rows, i.e., A — {a\, 02, . . ., a n }, where ai — {o»,i, a, ; 2, . . ., 
a it n}. Similarly, we can consider B as a vector of columns, each column 
denoted by bj = {bij, b 2 j, ■ ■ ., b n j}. If we treat ai and bj as polynomials, 
where 

ai(x) = ai,\+ai t 2X-\ h o» in a; n_1 , and 

bj(x) = b\jx n ~ 1 H h b n -ijx + b n j, 

then Cij = Y^k=i a i,kbk,j is nothing but the coefficient of a;" -1 in the 
polynomial a, (a:) -bj(x). Now, to compute C, we proceed as follows, choose 
M = (n + 2) maxi<i.j<„{aij, bij}, and compute a\(M), . . ., a n (M), 
61 (M), b n (M). Then 

c id = Quotient (a 4 (M) • bj(M), M" -1 ) mod M. 

Each of aj(M)'s and bj(M)'s can be computed using 0(n) operations, and 
each Cij can be computed using 0(1) operations. Therefore, the matrix C 
can be computed using 0(n 2 ) arithmetic operations. 
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Kostrikin [120], Matsumura [141, 142], van der Waerden [204] and Zariski and 
Samuel [216]. 

Dickson's lemma for power products first appears in [62]. For homogeneous 
ideals in K[xi, x n ], the concept of a basis was given by Hilbert [95], and 
Hilbert's basis theorem (Hilbert Basissatz) now appears in all textbooks on alge- 
bra, in its dehomogenized version. 

The concept and use of admissible orderings (also called term orderings) 
seems to have first appeared in the work of Buchberger [33] and then was further 
generalized by several authors. First characterization of all possible orderings 
appeared in the work of Robbiano [175, 176], and a more constructive character- 
ization appears in the papers by Dube et al. [64, 65]. 

In 1965, Bruno Buchberger, in his doctoral dissertation [30], introduced the 
concept of a Grobner basis, which Buchberger had named after his teacher, W. 
Grobner. During the last two decades, as the significance of Grobner bases has 
begun to be widely understood, several survey and expository papers on the topic 
have appeared: for instance, Barkee [12], Barkee and Ecks [11], Buchberger [33], 
Mishra and Yap [149], Moller and Mora [151], Mora [154] and Robbiano [177, 178]. 
There have been several generalizations of Buchberger's theory of Grobner bases 
to subalgebras [179], rings with valuations or nitrations [197], noncommutative 
rings [153], free modules over polynomial rings [16], etc. 

Problem 2.5 is based on some results due to Dube et al. [64] and Robbiano 
[175] and Problem 2.8 is due to Bayer and Stillman [17]. 

We conclude this note with the following excerpts from the paper by Moller 
and Mora [151] describing the evolution of the ideas inherent in Grobner bases. 

For homogeneous ideals in P := k[xi, . . ., x n ], the concept of a basis 
was first given by Hilbert, 1890,.... 

Macaulay observed, that any set of polynomials obtained by de- 
homogenization from a basis of a homogeneous ideal is a basis with 
favourable properties. He denoted such sets therefore H-bases [135]. 
Macaulay and subsequently Grobner and Renschuch stressed in their 
work the significance of H-bases for constructions in Ideal Theory and 
Algebraic Geometry.... 

The method of Grobner bases (G-bases) was introduced in 1965 by 
Buchberger and starting from 1976, studied in a sequence of articles 
[33].... 

The concept of standard bases , which is strictly related to that 
one of G-bases was introduced first by Hironaka [96] and studied 
further among others by Galligo [73, 74]. The main difference is 
that it was given for formal power series rings instead of polynomial 
ones (the difference can be easily stated saying that "initial" terms 
for standard bases play the role of the maximal terms for Grobner 
bases) ; the algorithmic problem of constructing such a basis was not 
undertaken until 1981, when it was solved by [Mora [152]] gener- 
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alizing and suitably modifying Buchberger's algorithm for G-bases. 
The strict interrelation between the two concepts (and algorithms to 
compute them) is made clear in [Lazard's work[128]]. 



Chapter 3 

Computational Ideal 
Theory 

3.1 Introduction 

In the previous chapter, we saw that an ideal in a Noetherian ring admits 
a finite Grobner basis (Theorem 2.3.9). However, in order to develop con- 
structive methods that compute a Grobner basis of an ideal, we need to 
endow the underlying ring with certain additional constructive properties. 
Two such properties we consider in detail, are detachability and syzygy- 
solvability. A computable Noetherian ring with such properties will be 
referred to as a strongly computable ring. 

Thus, we will start with the notion of a strongly computable ring, and 
then provide an algorithm that computes a Grobner basis for an ideal in 
S[xi, . . ., x n ], assuming that S is a strongly computable ring. Along the 
way, we shall also develop the concept of a head reduction 1 , which, along 
with the notion of ^-polynomial, will provide the basic ingredients for the 
algorithm. 

Next, we will provide a stronger form of Hilbert's basis theorem: namely, 
we shall see that if S is a strongly computable ring, so is S[x\, . . ., x n ]. 

We will conclude this chapter with a sampling of various applications 
of the Grobner basis algorithm to computational ideal theory. Examples of 
such applications include ideal membership, ideal congruence, ideal equal- 
ity, syzygy basis construction, sum, product, intersection and quotient op- 
erations on ideals. 



'Some authors simply use the term reduction for what we call head reduction here. 
However, we will reserve the term reduction for a slightly stronger process that was first 
introduced and used by Buchberger. 



71 



72 



Computational Ideal Theory 



Chapter 3 



3.2 Strongly Computable Ring 

Definition 3.2.1 (Computable Ring) A ring S is said to be computable 
if for given r, s € S, there are algorithmic procedures to compute — r, r + s, 
r ■ s. If, additionally S is a field, then we assume that for a given nonzero 
field element r e S (r ^ 0), there is an algorithmic procedure to compute 
r-\ □ 



Definition 3.2.2 (Detachable Ring) Let S be a ring, s € S and {si, 
■ • ■, s q } C. S. S is said to be detachable if there is an algorithm to decide 
whether s G (s\, . . ., s g ). If so, the algorithm produces a set {t\, . . ., 
£g} C S, such that 

s = h sH h t 9 s 9 . □ 



Definition 3.2.3 (Syzygy-Solvable Ring) A ring S is said to be syzygy- 
solvable if for any given {s l7 . . ., s q } C S, there is an algorithm to compute 
a (finite) syzygy basis, t\, . . ., t p for the S'-modulc S(s±, . . ., s q ) such that 



1. For all 1 < i < p, J2j U,j s j 



0. 



2. For any (m, .. ., u q ) = u E S q , J^j u j s j 



( 



3v 



,v p )eS p 



v\ ti H \-v p tp 



□ 



Definition 3.2.4 (Strongly Computable Ring) A ring S is said to be 
strongly computable if it satisfies the following four conditions: 

1. S is Noetherian, 

2. 5 is computable, 

3. S is detachable, and 

4. S is syzygy-solvable. □ 

Let S be a ring and R = S[xi, . . ., x n ] be a ring of polynomials. Let 
> be a fixed but arbitrary computable admissible ordering on PP(xi, . . ., 

A 

x n ). Assume that G C R and f £ R. Then the problems of (1) deciding 
whether Hmono(/) € Head(G) and (2) computing the 5-polynomials of G 
reduce to simpler detachability and syzygy solvability problems in the ring 
S, respectively. The following lemma shows the relationship between the 
membership problem for the head monomial ideal and detachability. 
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Lemma 3.2.1 Let G s = gC : Htcrm(.g) | Htcrm(/)|. 
Then Hmono(/) € Head(G) if and only if 

Hcoef(/)e ({Hcoe%): .geG/}). 

PROOF. 

(=>) Since Hmono(/) G Hcad(G), it is possible to write 

Hmono(/) = ^ a,i ■ Vi ■ Hmono(gi), a t G S and G PP(xi, . . . , x n ) 

s.eG 

such that K • Htcrm(gi) = Hterm(/) (i.e., gi G G/ ). Therefore, 

Hmono(/) = ^ ai • Hcoef(gi) • Htcrm(/) 

«eG/ 

Hcoef(/) = ^ ai- Hcoe%i) 
Hcocf(/) G ({Hcoe%) : g G G/}). 

(<S=) Hcoef(/) = ^ aj • Hcoef(5i), Oj G S 

Hmono(/) = Hcoef(/) • Hterm(/) 

= a * Jr—rA ' Hmono(fif i ) G Hcad(G), 
4^ Hterm(.g i ) 



since (y g G Gj 



Hterm(f) ^ , 

me^R ePP ^---'^ 



. □ 



3.2.1 Example: Computable Field 

Most of the commonly used rings do satisfy the requirements to be strongly 
computable. We give two examples in this section: namely, the computable 
fields (e.g., field of rationals, Q or field of integer mod a prime, Z,) and the 
ring of integers, Z. 

Example 3.2.5 (A Computable Field Is Strongly Computable.) Let 

S = K be a computable field. We show that K is strongly computable. 



1. if is Noetherian; recall that a field can have only improper ideals 
which are obviously finitely generated. 
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2. K is computable, by assumption. 

3. K is detachable. Let a £ K and {ai, . . ., a q } C K. If a ^ but 
ai = <i2 = • • • = a q = 0, then a ^ (ai, . . ., a g ); otherwise, assume 
that some ^ and has a multiplicative inverse aT" 1 . Then 

o = 0-ai + -- - + 0- di-i + a ■ a^ 1 ■ a* + ■ ■ ■ + ■ a g . 

4. K is syzygy-solvable. Let {ai, . . ., a 9 } C (without loss of general- 
ity, we may assume that every a, 7^ 0). Then the syzygy of (ai, . . ., 
a q ) is a (q — l)-dimensional vector space, orthogonal to the vector 
(ai, . . ., a g ), with the following basis: 

*i = (arS - fl 2~\ 0, . . • ,0) 
h = (0, aj 1 , -as\...,0) 

t~[ = (0, 0, ...,a~\, -a' 1 ) 

To verify that it is really a syzygy basis, notice that it satisfies both 
conditions of the definition of syzygy solvability. 

The first condition holds, since, for all i, X)|=i ti,j a j = 0- 

U ■ a = • H h aj 1 ■ ai - ■ a i+1 H h • a q 

= 1-1 = 0. 

Let d = (di, . . ., d q ) such that Y^j=i^j a j = 0- Then, in order to 
satisfy the second condition, we need to determine a tuple v = (v\, 
. . ., «<j-i) such that 

d = Vl-h-\ \-Vg-l ■ tg-l. 

We choose v as follows: 

vi — ai ■ di 

v 2 — ai ■ di + a 2 ■ d 2 

Vi = ai • di + a 2 ■ d 2 H h a, • d { 

v q -i = ai ■ di + a 2 ■ d 2 H h a,-i • 

Then the j th component of v\ ■ t\ + ■ ■ ■ + v q -i • is computed as 
follows: 
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• The first component is v\ ■ a 1 1 = a\ ■ d\ ■ a 1 1 = d\. 

• For 1 < j < q, the j th component is 

v j ' + v j-i ' 

= (ai • d\ + h a,j ■ dj) ■ aj 1 

- (ai ■ d\ H h Oj_i • • aj 1 

= aj-dj-aj 1 =dj. 

• Finally, the q th component is 

-Uq-i-a^ 1 = -(ai-diH h a,-i • d,-i) • a," 1 . 

But rfjOj = 0; therefore, — ai -di — • • • — a 9 _i = a g -<i g 

q • dq • CI dq 



and the q th component is simply a q -d q -a 1 = d q . Thus tj's form 



a basis for the syzygy of (ai, . . ., a q ). □ 

S- Polynomials in K[xi, . . ., x n ] 



Lemma 3.2.2 Let G C if [xi, . . ., £„] fee a /mife set of polynomials over 
a field K and let 

* (9j ,9k)- Hmono(gj ) ' 9j Hmono(fffc ) ' 9k 

where gj, g% € G and gj ^ g^, m = LCM(Hterm(gj), Hterm(<7fc)). 

Then G satisfies the syzygy condition if and only if every S(gj, gu) can 
be expressed as 

S{g 3 ,gk) = ^2 fcgi, ft € K[xi, ...x n ] and g t e G, 

where Hterm(S(gj, > Hterm(/j) Htcrm(gi). 

A 

PROOF. 

We need to observe that every S(gj,gk) is an 5-polynomial for the set 
{9j>9k} C G and every 5-polynomial of any subset of G can be expressed 
as a power-product times some S{gj 1 gk)- 
Let {g 1: . . ., g q } C G, and 

|ai = Hcoe%i), . . . , a q = Hcoef(5 9 )| C K. 

In the syzygy basis for (oi, . . ., a q ), each ti has only two nonzero entries; 
namely, a~ l and — a^ 1 . Hence, each S'-polynomial of G has the following 
form: 

M M 
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where M = LCM(Hterm(gi), . . ., Hterm(5 9 )) = p-LCM(Hterm(#,-), Hterm(# fe )), 
which can be written as 



P-S(9j,9k), pePP(n 



,x n ). □ 



In view of the above lemma, we shall often refer to S(gj,gk)'s as 5-polynomials 
while working over a field. 

3.2.2 Example: Ring of Integers 

Example 3.2.6 (Ring of Integers Is Strongly Computable.) Let S = 

Z be the ring of integers. We recall the following useful property of Z: for 
any {a 1} . . ., a q } C Z, 



(a&ez) [() = (^,...,D„) 



and 

1. b= GCD(ai, 



. ., a q ). Let 
, Oi 



02 

6 : 



2. Since fee (oi, . . ., a q ), 

(3d,...,C q € zj 



= f^f + • • • + M D| 



Note that 6, ci, . . ., c 9 can be computed using Euclid's algorithm; the 
details of Euclid's algorithm can be found in Knuth [116]. 
Now, we show that Z is strongly computable. 

1. Z is Noetherian. This follows from the fact that every ideal in Z is 
generated by a single element in Z. (That is, every ideal of Z is a 
principal ideal and thus Z is a principal ideal domain.) 

2. Z is computable. The necessary algorithms are easy to construct. 

3. Z is detachable. Let a € Z, {ai, . . ., a q } C Z and b = GCD(cti, . . ., 
a q ). If b \ a, then a £ (b) = (a\, . . ., a q ); otherwise, a £ (b) = (oi, 
. . ., a q ). Now, if a = d ■ b, then 

a = (d ■ c\)a\ + ••• + (d- c q )a q , 

where obtained from Euclid's algorithm. 

4. Z is syzygy-solvable. Let {a\, . . ., a q } C Z, 

6 = GCD(ai, . . . , a q ) = c\a\ H h c 9 a 9 , 
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and a\ — di/b (for i = 1, . . ., q). Then the syzygy of (oi, . . ., a q ) has 
the following basis: 

ti = ((c2a' 2 -\ \-c q a' q ), -C2a' l5 . . . , -c q a[) 

*2 = (-cia 2 , (c^+^a^ \-c q a q ), -c 3 a 2 , ■ ■ ■ , -c q a' 2 ) 

tq = (-ClflSg, . . . , -C q -ia' q , {c\a\ H \- C q -ia! q -l)) 

To verify that it is really a syzygy basis, we need to show that it 
satisfies both conditions of the definition of syzygy solvability. 

The first condition holds, since, for all i, X^j=i ^i,i a j = 0. 



9 



ci ttj ai Cj_i a, a 4 _i 

+ Ci a'j aj H h Cj_i a-_ x a 4 + c i+ i a' i+1 a t H h^a^o, 



- c J+ i a- dj+i - • • • — c g a- a g 

= "Ci— Cj_i — - — 

O 



aiOi cii-idi Chi+idi a„ai 

— + --- + c i -i—^ + c i+1 — + ...+c q — 

diCli+1 _ aia q 



c i+1 — c q b 



= 0. 



Let d = (di, . . ., d 9 ) such that X^jLi 4? a j = 0- Then, in order to 
satisfy the second condition, we need to determine a tuple v = (v\, 
. . ., v q -i) such that 

d = Vl ■ iH \-v g -i ■ t q -l. 

We show that the choice v = d satisfies the condition, that is, the j th 
component of d\ti H h d q t q is dj itself. The j th component is 

di ti.j + d2 t2,j + • • • + d q t q j 

= — di Cj a'j — d 2 Cj a 2 — ■ ■ ■ — dj-i Cj ci'j_ 1 

+ dj (ci a[-\ h Cj-i a' j _ 1 + c j+ i a' j+1 H + c q a' q ^j 

- d J+1 Cj a' j+1 d qCj a' q 



+ d 



d x ai H h dj_iOj_i + d J+1 a J+1 H h d q a q 

b 

CiOi -I h Cj_iaj_i + Cj + iaj + i j h c,a, ' 

6 
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+ d 



- d 3 a J 

b 

c\a\ + ■ ■ ■ + Cj-idj-i + Cj + \aj + i + ■ ■ ■ + c q a q 
b 

(Using the fact that ^^djdj = 0) 



dj 



cia\ H h c„ar 



b 

= dj. (Since ^ Cjflj = b) 

i 

Thus ti's form a basis for the syzygy of (oi, . . ., a q ). □ 

Remark 3.2.7 There is another syzygy basis for (<zi, . . ., a q ) C Z with 
a somewhat simpler structure. Let {a±, . . ., a q } C Z, b = GCD(ai, . . ., 
a q ) = c\d\ + • • • + c q a q and bij = GCD(aj, aj). Then the syzygy basis for 
(ai, . . ., a q ) can be given as follows: 

nj=(o,...,o, ^- , o,...,o, --p- , o,...,o 

position i position j 

for 1 < i < j < q. 

It now remains to check that both conditions for syzygy-solvability are 
satisfied. 

\ aj ■ ai ai ■ aj 

j l J u% >3 

2. Since ti, . . ., t q is a syzygy basis (i.e., any element u of syzygy can be 
written as a linear combination of ti's), it is enough to show that each 
U can be written as a linear combination of Tij. Note that b \ bij . 
Let b'i j = bij/b. Then 

- ci b' x i i\i — C2 b' 2ji T%i - ■ ■ ■ - Ci-i b / i _ l i r»_i,» 

+ C i+1 nj+l H 1" C q K,q T\q 

-ClJ, 0, 0, Cly, 0, 

0, -c 2 ^, 0, 0, c 2 ^, 0, 



H 

/ ai i 

+ ^o, . . . , o, -a-!—, Ci_i-^-, o, . . . , o 

+ ^0, . . . , 0, Cj+1-^-, -c m y, 0, . . . , 
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+ (0, ...,0, c t+2 ^, 0, -Ci +2 p 0, ...,0) 
+ ••• 

+ (0, ...,0, c q ^, 0,...,0, -c,^) 

= (-cia'i, -c 2 a-, -Ci_ia-, 

(ci ai H h Cj_i a-_j + c l+1 a' i+1 H he, a' g ), 

-c i+ ia-, -Cga'j) 
= U. □ 



^-Polynomials in Z[>rv, . . ., x n ] 



Lemma 3.2.3 Lei G C ZL^ 7 . . x„] be a /mife sef of polynomials over 
the ring of integers Z and Zet 

S( )= — • - — • 
J '' * Hmono(^j) Hmono(<7fc) ^ fc 

where gj, gu <E G, ^ 7^ and fh — LCM{Hmcmo{gj), Hmono(gfc)). 

Then G satisfies the syzygy condition if and only if every S(gj, g^) can 
be expressed as 

S(9j,9k) = ^2fi9i, /.e^K,..^,,] and 9n e G, 

w/iere Hterm(5(<7j, gfc)) > Hterm(/j) Hterm(^j). 

PROOF. 

The proof proceeds in a manner similar to the case for a field if (sec 
Lemma 3.2.2). Let {51, . . ., g q } C G, and 

jai = Hcoe%i), . . . ,a q = Hcoe% g )| C Z. 

We have seen that a basis for the syzygy of (01, . . ., a g ) can be written as 
follows: 



7^=(0,...,0, -i- ,0,...,0, --^,0,...,0 

position j position k 

for 1 < j < k < q. Hence, each 5-polynomial of G has the following form: 
Hcoc%) M 



GCD(Hcoc%), Hcoef( 5fc )) Hterm( 5j ) 

Hcoe% fc ) M 



9j 

M 

9k 



GCD(Hcoe%j), Hcoc% fe )) Htcrm( 5fe ) 
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LCM(Hcocf( g j), Hcocf(g fc ))M _ LCM(Hcocf( gj -), Hcocf(g fc ))M 
Hmonofgj) ^ Hmono(<?fc) 

= P-S(9j,9k), where 

M = LCM(Hterm(5i), . . . , Hterm(g g )) 

= pLCM(Hterm(gj), Hmono(jt)), and p £ PP(a;i, . . . , x n ). □ 

In view of the above lemma, we also refer to S(gj,gk)'s as ^-polynomials 
while working over the integers. 
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3.3 Head Reductions and Grobner Bases 

In this section, we shall develop an algorithm to compute a Grobner basis 
of an ideal in a polynomial ring over an ambient, strongly computable ring. 
First we need one key ingredient: head reduction. 

Definition 3.3.1 (Head Reductions) Let S be a ring, let R — S[xi, 
. . ., x n ], let > be a fixed but arbitrary admissible ordering on PP(xi, . . ., 

A 

x n ), and let G = {gi, g%, . . ., g m } C R be a finite set of polynomials. We 
say / <G R is head-reducible modulo G if / ^ and Hmono(/) e Head(G). 
If / is head-reducible modulo G and, specifically, 

m 

Hmono(/) = ^ aiPi Hmono(ft), where a% £ S,Pi £ PP(ii, . . . , x n ) 
and j3jHtcrm((7j) = Htcrm(J), then the polynomial 

m 

h = f - ^2 GlPi 9* 

i=l 

is said to be a head-reduct of / modulo G, and is denoted by 

!• G.h , 

/ — >h. 

We also write — for the reflexive and transitive closure of — I; that is, 

, G,h , 

f — >h, 

if there is a finite sequence hi, hi, . . ., h n (n > 1) such that hi — /, h n = h, 
and 

hi — -> hi + i, for i = 1, . . . ,n — 1. 

If / is not head-reducible modulo G, or if / = 0, we use the (perhaps, 
unfortunate) notation 



Definition 3.3.2 (Head-Normal Forms) We say h is a normal form of 
f modulo G under head-reduction (briefly, head-normal form or simply, 
normal form if there is no confusion) if 

G.h , G.h . 

/ — >h—^h. 

We write NFq(/) for the set of head- normal forms of / modulo G. □ 
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Theorem 3.3.1 If S is a ring, R — S[x\, . . ., x n ] and G — {gi, gi, ■ ■ ., 
g m } C R, then 

(V / g R) [nF&(/) ± 0" . 

PROOF. 

We proceed by well-founded induction on Hterm(/) with respect to the 
well-ordering > . 

A 

Suppose / = 0. Then Hmono(/) = and 

NF^(/) = {0} ± 0. 

Henceforth, assume that / ^ 0, and thus Hmono(/) ^ 0. To handle the 
base case, note that if Hterm(/) = 1, then either Hmono(/) ^ Head(G), in 
which case / is already head-reduced modulo G and 

NF£(/) = {/} 0; 

or Hmono(/) e Head(G), in which case / head- reduces to modulo G and 

NF£(/) = {0} 0. 

To handle the inductive case (Hterm(/) > 1), we assume by the induc- 

A 

tive hypothesis that for all h £ R 

Hterm(fc) < Hterm(/) => NF^(fe) ^ 0. 

A 

As before, either Hmono(/) ^ Head(G), in which case / is already head- 
reduced modulo G and 

NF^(/) = {/} 0; 
or Hmono(/) € Head(G), in which case / head-reduces to h modulo G and 
Hterm(/i) < Hterm(/) and NF&(/) D NF^(ft) ^ 0. □ 

A 

Theorem 3.3.2 Let S be a ring, R = S[xi, . . ., x n ], I be an ideal in R, 
and let G = {gi, gi, . . ., g m } C I be a finite subset. Then the following 
three statements are equivalent: 

1. Hcad(G) = Head(J). 

2. Every f £ I head-reduces to modulo G: 

(v/e/) [/ 

3. (G) = / 



(vfcg) (yhe SP(F)^j 
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PROOF. 

[(1) =* (2)] 

We proceed by induction on Hmono(/) with respect to the well-ordering 
> . 

A 

If / = then we are done. Thus we may assume that / ^ and / e /, 
and that, by the inductive hypothesis, for all h e I 

Hterm(/i) < Hterm(/) => h -^0. 

A * 

Since / e I, Hmono(/) € Head(J) = Head(G) and / head-reduces to some 
h e I, i.e., 

f^h, Hterm(/i) <Hterm(/), and he I. 

A 

If ft. = then we are done; otherwise, by the inductive hypothesis, 

. G,h , G,h _ 

/ — ► h — > 0. 

* 

[(2) =►(!)] 

Note that if / — -> then / can be expressed as 



/ = X] ^ where /, e S[xi,...,iJ, and 

Hterm(/) > Hterm(/j) Hterm^), i = 1, . . . , m. 

A 

Thus, condition (2) implies that G is a Grobner basis and Hcad(G) = 
Head(J). 

[(1) => (3)] 

Since G C I and Head(G) = Head(I), G generates I. Furthermore, every 
S'-polynomial h e SP(F) (F C G) is an element of I, and thus 



(vfcg) (whe SP(F) 



[(3) => (1)] 

Note that the condition, 



(vfcg) (yhe SP(F)^) 



simply implies that G satisfies the syzygy condition. 

Since we also assume that (G) = I, by Theorem 2.5.2, we conclude that 
Head(G) = Head(J). □ 
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3.3.1 Algorithm to Compute Head Reduction 

In this subsection, we present an algorithm that computes the head reduc- 
tion and then, using this algorithm, we develop an algorithm for Grobner 
basis. 

OneHeadReduction(/, G) 

Input: / G R; G C R, G = finite. 

Output: h such that / h. 



if / = or Hcoef(/) £ (JHcoef( 3 ): # G G f }J then 
return /; 

else 

Let Hcoef(/) = a; • Hcoef(gi), a; G S; 

ai eG f 

x - Hterm( f ) 
return / - \ a; • — -M- • gv, 

end{if }; 

end{ONEHEADREDUCTION} □ 

Using the above routine, we can compute the head reduction of a given 
polynomial /. 

HeadReduction(/, G) 

Input: / G R; G C R, G = finite. 

Output: h such that / < ^ J -> h ft. 
ft : /; ft' : /; 



loop 

ft := ti- 
ll' := OneHeadReduction^, G); 
until ft — ft'; 

return ft; 

end{HEADREDUCTION} □ 



Correctness and Termination: 

The correctness of the algorithm HeadReduction follows directly 
from the definition of the head reduction. 

In order to prove the termination properties, notice that the until loop 
satisfies the following loop-invariant at the end of the loop 

Hterm(/i) > Htcrm(ft'), i.e., 

A 

h^ti => Hterm(ft) > Hterm(/i'). 
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Thus, the head terms of h! are monotonically decreasing, and since > 

A 

is a well-ordering, we cannot have an infinite sequence of h"s. Thus, the 
algorithm HeadReduction must eventually terminate. 

3.3.2 Algorithm to Compute Grobner Bases 

Now we are in position to give an algorithm that computes the Grobner 
basis for a finitely generated ideal. 

First, we present a routine which checks if a given basis G is a Grobner 
basis for the ideal generated by G and it returns an 5-polynomial of G if 
G is not a Grobner basis for (G). 

GrobnerP(G) 

Input: G C R, G = finite. 
Output: S*(G) = |J NF%(SP{F)), 

— head-normal forms of all the 5-polynomials. 

S(G) = {0}, if G is Grobner basis of (G); 
S(G) / {0}, if G is not a Grobner basis of (G). 

S := 0; 

for every nonempty subset FCG loop 

Compute S' = the S-polynomials of F; 

for every b! e S' loop 

S := S U {HeadReduction (ft', G)} 

end{loop }; 
end{loop }; 

return S; 
end{GROBNERP} □ 



Correctness and Termination: 

The correctness of the algorithm GrobnerP is a simple consequence 
of the characterization of Grobner bases in terms of the syzygy condition: 

G = Grobner basis for (G) 



if and only if 
(VFCG,F^) (v h F e SP(F)^j h F 



<S(j 
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Termination, on the other hand, is a simple consequence of the following 
obvious conditions: 

1. Finiteness of the set of S'-polynomials SP(F) of F. 

2. Termination of the head reduction. 

Now, we give the main algorithm which computes a Grobner basis for 
an ideal generated by a given finite basis H: 



GROBNER(ff) 

Input: H C R, H = finite. 

Output: G C R, G = Grobner basis for (H). 

G := H; 

S := GrobnerP(G); 

while S / {0} loop 

G := G U S; 
S := GrobnerP(G); 
end{loop }; 

return G; 
end {Grobner} □ 



Correctness and Termination: 

The correctness of the algorithm Grobner follows from the following 
two facts: 

1. Let Gi be the value of G while the loop is executed the i th time. 



The algorithm maintains the following loop invariant: 



• (Gi) C (H). Since, S C (G,_i) = (H), we have 

G l - G,_! U S C (H) => (d) C (H). 

• Conversely, since _ff C G„ we have (i?) C (Gi). 



Thus 



= G C Gi C G 2 C • • • . 




Section 3.3 



Head Reductions and Grobner Bases 



87 



2. At termination, the syzygy condition is satisfied: 

(v F C G, F £ 0) (V h F e SP(F)) ^ 

if and only if 
S = GrobnerP(G) = 0. 

Next we show that, as a result of the Noetherianness of R, the termi- 
nating condition "S = {0}" must eventually be satisfied: 



S = Gr6bnerP(G 1 ) ^ {0} 
(3 h e S 



Hmono(/i) ^ Head(Gj) 
=> G i+1 =G,US and Hcad(G 4 ) £ Hcad(G m ) 

But, in i?, it is impossible to obtain a strictly ascending chain of ideals; 
that is, eventually, 



(?') 



S = Gr6bnerP(G £ ) = {0} 



Note that the algorithm given above is a very high-level description. 
It does not say anything about how to compute it efficiently. There are 
several issues involved here; for example, how to represent a multivariate 
polynomial so that all of the operations can be done efficiently, how to 
find at in OneHeadReduction quickly, how to compute S'-polynomials 
efficiently, etc. 

The Grobner basis algorithm is usually presented somewhat differently. 
In order to keep the exposition simpler, we have essentially divided the 
algorithm into two routines: (1) a predicate that decides if a given basis is 
a Grobner basis (GrobnerP) and (2) a procedure that keeps enlarging a 
given set until it satisfies the preceding predicate (Grobner). We conclude 
this section with the presentation of the Grobner basis algorithm in its more 
widely used form: 
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GROBNER(i?) 

Input: H CR, H = finite. 

Output: G C R, G — Grobner basis for (H). 

G := H; 

while there is some nonempty subset FCG 

such that some S-polynomial h € SP(F) 
does not head-reduce to 

loop 

ti := HeadReduction(/i, G); 

Comment: By assumption ti / 0. 
G := GU {ti}; 
end{loop }; 
return G; 
end{ Grobner} □ 

3.4 Detachability Computation 

Let us now consider the detachability property of a polynomial ring R = 
S[xi, . . ., x n ], given that the underlying ring of coefficients 5* itself is 
strongly computable. In particular, we will explore the algorithmic con- 
structions for detachability, building up on our Grobner basis algorithm of 
the previous section. It is not hard to see that the existence of a finite and 
computable Grobner basis in R allows one to solve the problem of ideal 
membership as follows: 

IdealMembership(/, H) 

Input: H C R and a polynomial / £ R; H — finite. 
Output: true , if / G (H). 

To solve this problem, first compute G — the Grobner basis for H, and 
output True if HeadReduction (/, G) = 0; otherwise, output False. The 
correctness follows from the fact that / e (H) if and only if / e (G), i.e., 

if and only if / ^+ 0. 

The above algorithm can be easily modified to express the polynomial 
/ as a linear combination of the polynomials in the Grobner basis G. But 
in order to solve the detachability problem, we need to do a little more, i.e. 
express / as a linear combination of the elements of H. 
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The easiest way to do this is to precompute the expressions for each 
element of G as linear combinations of the elements of H, and then substi- 
tute these expressions in the equation for /, which expresses / as a linear 
combination of the polynomials of G. We show, in detail, how this can 
be accomplished with appropriate modifications of the algorithms: One- 
HeadReduction, HeadReduction and Grobner. 

3.4.1 Expressing with the Grobner Basis 

Given a finite set of generators H = {hi, . . ., hi} C R, and a (finite) 
Grobner basis G = {gi, . . ., g m }, of the ideal (H), there are two matrices: 

1. an / x m matrix X = {xij} in R lxm , and 

2. an / x m matrix Y = {yij} in R lxm 
such that 



and 



.91 


= xi t ihi + ■ ■ 


■ + xi i ihi 




9i 




' hi - 








> i.e., 




= X T 




g m 


= Xi tTn hi + • 


■ ■ + Xl tm hl > 




. 9m _ 






hi 


= yi.iPi + • 


• + yi, m g m 




' hi ' 




9i 










> i.e., 




= Y 






hi 


= yi,igi + ■ ■ 


■ + yu m g m , 




. h i . 




. 9m _ 





Thus, in order to solve the detachability problem for R, we need to 
solve the following problem: 

Input: H = {hi, . . ., hi} C R; H = finite. 
Output: G — {gi, . . ., g m }, a Grobner basis for (H); 

Matrix X = { Xi ,j} G R lxm , and 

Matrix Y = {y i}j } £ R lxm . 

Let us begin by modifying OneHeadReduction and HeadReduc- 
tion, such that the NewHeadReduction solves the following problem: 
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Input: G = {gi, . . ., g m } C R, and f € R. 

Output: h =HeadReduction(/, G), and express it as 

/ = 2/101 H h ymffm + /l. 



We proceed by making changes to the algorithm OneHeadReduction 
as follows: 

NewOneHeadReduction(/, G) 
Input: f e R; G C R, G = finite. 

Output: h such that / — I h, and 

2/i, y m such that / = t/i^i H h j/ m g m + ft- 



if / = or Hcocf(/) £ ^{Hcoef(g): jeG/jj then 



return ^t/i := 0, . . . , y m ■= 0, h := f 
else 

Let Hcoef(/) = a »j " Hcoef^), a fj £ S 

return ( t/i := 0, . . . , 



Hterm(/) _ Hterm(/) 

Vil atl ' Hterm( Sil ) ' ' ' ' ' V ^ : ~ ^ ' Hterm( ffi J ' 

Vm ■= 0, 



h:= f- J2 a b 



Hterm(/) 
Hterm( 5i ) 



9ij 



Comment: {ii, . . ., i k } C {1, . . ., m}. 

end{if }; 

end{NEWONEHEADREDUCTION} □ 

To see that the algorithm is correct, all we need to observe is the fol- 
lowing: 



Hterm(/) Hterm(/) 
1 = a -'Hterm( 5jl ) 3ll+ ^ + flifc 'Hterm( 5 5ifc+ ^ 

= Vn 9ii H r- y ifc ft fc + ft 

= 2/151 H r- 2/m 5m + ft- 

[Since all the j/j's except j/j 's are 0.] 
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Now we can modify the HeadReduction algorithm to keep track of the 
coefficient polynomials as it repeatedly "calls" NewOneHeadReduction. 
The correctness of the algorithm can be shown by an inductive argument 
over the number of iterations of the main loop. 

NewHeadReduction(/, G) 

Input: / G R; G C R, G = finite. 

Output: h such that / h h, and 

2/i, . .., ym such that / = y 1 g 1 H + y m g m + h. 

(yi,...,y m ,h) := (0,...,0,/>; 
(y' u ...,y' m ,ti) := (0,...,0,/); 

loop 

h := ti 

{yi, ■■■,y m ) ■= (yi, ■ ■ ■ ,2/m) + {y'i, ■ ■ -,y' m ); 

(y'i, . ■ ■ ,y' m , h') := NewOneHeadReduction(/i, G) 
until h — h'; 

return (y 1 , . . . ,y m , h); 

end{NEWHEADREDUCTION} □ 

Now we are ready to modify the algorithm Grobner (on page 88) in 
such a way that it also produces the X and Y matrices as by-products. 

The main idea is to incrementally compute the X matrix (expressing G 
in terms of H) as the computation of G progresses: Initially, we begin with 
G = H, and the X matrix is simply an identity matrix. At any point in the 
loop, as we add a new elment g to G, we know how to express the new g in 
terms of the elements of G computed so far. But, now, using the currently 
computed matrix, we can also express g in terms of H, and hence the new 
row of X. Again by an induction on the number of iterations of the main 
loop, the correctness of the computation of X can be demonstrated. 

The computation of the Y matrix is, in fact, relatively easy. Since at 
the termination G is a Grobner basis and since each element hi € H is in 
the ideal (G) = (H), hi^O, and the algorithm NewHeadReduction 

gives the i th row of Y: 

hi = Vi,i gi + 2 92 H h Vi,m 9m- 



NewGr6bner(_H") 

Input: H C R, H = finite. 

Output: G C R, G — Grobner basis for (H), and 
the matrices X and Y, relating G and H. 
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9i ■= hi; 

(£1,1,3:2,1, • • -xi,i) ■= (1,0, 
g-2 := hr, 

(£1,2, £2,2, • • -x l>2 ) ■= (0, 1, 
Qi ■= hi; 

{xi,i,x 2 ,i, ■ ■ ■ xi t i) := (0, 0, ... , 1); 

Comment: G = H and X = the identity matrix. 

k := I; 

while there is some nonempty subset F C G 

such that some 5-polynomial ft £ SP(F) 
does not head-reduce to 

loop 

Let ft = cugi H + a fc g fe ; 

(yi,...,y k ,ti) := NewHeadReduction(/i, {31, . . . ,g k }); 
Comment: Note that 

ti = h- yigi y k g k 

= (ai- 2/i)3i H 1- - Vk)g k 

= (01 — J/l)(xi,i/li H 

+ ■■■ 

+ - y k )(xi, k hi H h a;i,fcfti) 

= ((oi - J/i)»i,i H H (at - y k )xi,kjhi 

H 

+ ((ai - yi)aji,i H h (o fc - y k )xi^hi 

end{ Comment}; 
y fc+ i := ft'; 

(2:1, k+i, K2,fc+i, . . . xi lk +i) := 

((01 - j/i)a;i,i H h (o fc - y k )x Xtk , 

(01 - 2/1)12,1 H h (ok - y k )x 2 , k , 

• ' ' 5 

(01 - H h (o fc - y k )x iik ); 

G := GU {ft'}; 
fe := fc + 1; 
end{loop }; 

(yi, 1,2/1,2, • • • ,yi,m) := NEwHEADREDUCTiON(fti,G)[l..m]; 

(j/2,1,2/2,2, • • • ,y2,m) := NEWHEADREDUCTION(ft 2 ,G)[l..m]; 

{yi.i,yi,2, ■ ■ -,yi,m) ■= NewHeadReduction^, G)[l..m]; 
return (G,X,Y); 
end{NEwGROBNER} □ 
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3.4.2 Detachability 

Using the machinery developed in the previous subsection, we are now 
ready to solve the detachability problem for the polynomial ring, R = S[x\, 

. . . , Xn, • 



Detach(/, H) 

Input: H — {hi, . . . , hi} C R, H — finite, and an / G R. 
Output: Decide whether / € (H), 

If so, then return {xi, . . . , x{\ 

such that / = xih\ + ■ ■ ■ + xihi; 

Otherwise, return False. 

(G,X,Y) := NewGr6bner(#); 

Comment: G = Grobner basis for (H); 

the matrices X and Y relate G and H. 
end {Comment}; 

(j/i, . . . ,y m ,h) := NewHeadReduction(/, G); 
if h = then 

Comment: / € (G) = (H), and 

/ = ^151 H 1" 2/m3m 

= {yixi,i H + y m xi, m )hi 

+ ■■■ 

+ {yixi,i H h y m xi, m )hi 

end {Comment}; 

return {2/1X1,1 H h y m xi, m , . . . , yi^.i H h y m a;i,m}; 

else 

return false ; 
end{if }; 
end{DETACH} □ 

Proving the correctness of the algorithm is now a fairly simple matter. 

Note that / g (H) if and only if / e (G), i.e., if and only if /— UO. In 

this case, the algorithm NewHeadReduction determines the fact that / 
reduces to zero and that / can be expressed in terms of gi's as follows 

/ = 2/151 H ^Vm 9m- 

The rest of the algorithm simply re-expresses <7,'s in terms of hi's using the 
matrix X. 
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3.5 Syzygy Computation 

Now, we are ready to consider the syzygy solvability property of a polyno- 
mial ring R = S[xi, . . ., x n ], where we again assume that the underlying 
ring of coefficients S is strongly computable. As in the previous section, 
the algorithmic constructions for syzygy solvability will rely on our Grobner 
basis algorithm developed earlier. In order to keep our exposition simple, 
we shall proceed in two stages: First, we will solve the problem for a spe- 
cial case, where we compute the syzygies of a set which is also a Grobner 
basis of its ideal. Next, we deal with the general case, where the set is any 
arbitrary finite subset of R. 

3.5.1 Syzygy of a Grobner Basis: Special Case 

We start with the following simple case: Compute the syzygies of the set 
of polynomials 

G = {gi, ...,g q }c R, 
where G is a Grobner basis for (G) (under some fixed admissible ordering) . 



Input: G = {gi, . . ., g q } C R; G = finite. 
G = a Grobner basis for (G). 

Output: A finite basis, {ti, . . ., t^} for the i?-module S(G) C R q 
Let T denote the following p x q matrix over R: 



where t,,,- £ R 







' ti.i • 


■ *1,9 


tp 




t p ,i ■ 





such that 

1. For all 1 < i < p, 



X! tl ,i 9j = o. 



2. For any (ui, . . ., u q ) = u G R q 

3 

(3v = (v u ...,v p }eR p ) 



u = vih-\ \-v p t v 
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Let F = {<7i! , . . ., gi k } C G be a nonempty subset of G. Let 
s = ( s ii > ■ • ■ ! s i k ) € S 1 

be a tuple in the syzygy basis for {Hcoef^i-J, . . ., Hcoef(gj fc )} C In T, 
we have an entry, t, for each such F and s as explained below: 

• Let h be the S'-polynomial corresponding to the subset F and the 
tuple s. That is, 

to ui 
h = Sl1 ' Hterm( 5n ) " 9h + ' ' ' + Si " ' Hterm( ftfc ) ' 9l » ' 

where m = LCM(Hterm((; il ), . . . , Hterm(g ifc )). 

• Since G is a Grbbner basis, we have 

and h can be expressed as follows (since R is detachable): 

h = fl ■ gi H h /g • ffg, 

where to > Hterm(/i) > hterm ( /, ) Hterm((/ i ), for all i. 

A A 

• Let t, now, be given by the following g-tuple (in R q ). 

t = ( -fi, 

, m _ , _ , 

Hterm(fifiJ 

. . . , 

Ji k -l, S ik ■ Hterm(5 .j /i*. /<* + !. 
• • • i — /g 

• From the discussions in the previous section, we know that all of 
the above steps are computable, since, by assumption, S is strongly 
computable. 
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The correctness of the above procedure is a direct consequence of the 
following observations: 



1. Each tj (1 < j < p) satisfies the following condition: 

tj,i ■ 9i H + tj yQ ■ g q 

m m 

= Stl ' Hterm( ffil ) ' 9il + ' ' ' + 8i " ' HtcrmfoJ ' 9lk 

~fl ■ 91 fq ■ 9q 

= h-h = Q. 
Thus tj is in the i?-module of the syzygies of G. 

2. Let u = (ui, . . . , u q ) be an arbitrary element of the i?-module S(G). 
We need to show u can be expressed as a linear combination of tj's, 
i.e., that there are v\, . . ., v p £ R such that 

u = vi h H 1- v P V 

Given a u, we define as its height, the power-product M, where 
M = rnax |Hterm(uj) Hterm(gi) : 1 < i < gr|. 



Now, assume to the contrary, i.e., there exists a u £ R q so that we 
obtain the following: 

(a) ue 5(G). 

(b) u<£Rtl-\ h R T v . 

(c) Every element of R q satisfying the above two conditions has a 
height no smaller than that of u. The existence of u is guaranteed 
by our hypothesis and the well-foundedncss of the admissible 
ordering < . 



We shall derive a contradiction! 
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• M = the height of u. 

L = . . . , ik} C {1, . . . , q} such that 

(1) j e L => Hterm(uj) Hterm(fiij) = M. 

(2) j £ L => Hterm(Mj) Hterm(^) < M. 

A 

• Since Ui ■ gi H + u q ■ g q = 0, we see that 

Hcoef^UjJ • Hcoef^jJ + • • • + Hcoef(wi fc ) • Hcoef(<7i fc ) = 0. 

Let 

Sl,l ■ • ■ Si k 



S = 



Sl 




. si . 





Sl,k 



where Sij e S 



be a basis for the S'-modulc, 5 , ^Hcocf(_g il ), . . . , Hcocf(g ijc fj . 
Hence there exist v[, . . ., v\ £ S such that 

(Hcoef^jJ, . . . ,Hcoef(ujJ) = v[ ■ sIH h v[ ■ sj. 

Let m = LCM ^Hterm^jJ, . . . , Hterm(gi fc )^ , and Q = — . Thus 
m-Q = M = Hterm(uj 1 ) Hterm^jJ = • • • = Hterm(ui fc ) Hterm(<7i fc ' 
The following key observation is needed in the rest of the proof: 



Hmono(M il ) • g ir 



Hmono(w ifc ) • g it 



+ ■ 



+ 



v'i-Q 



VHtcrm^iJ 

/ SL k m 
VHterm( ffl J 



+ --- + v' r Q 



■ Q ■ ( - 



s/,i m 



\mcrm(g h ) 



v > . q . ( S -Lh m 

1 V Hterm (5iJ 



9h 



9r k 



Let t[, . . ., t\ be the elements of the basis T for S(G), each corre- 
sponding to an S'-polynomial for a subset F — {g^, . . ., gi k } and an 
element of {sT, . . ., s7}. 

Let 



u' = (u\, . . . , v!) = u-v[-Q-t' 1 



v'rQ- t' r 



<)<S 
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By the construction, we see the following: 

1. 7e S(G). 

2. u' $ R t\ + ■ ■ ■ + R t p , since, otherwise, we can also find v\, . . ., 



v p £ R such that 



u = vi ■ t\ H h v p ■ t p . 



3. If we now show that the height of u' is smaller than M under the 
admissible ordering < , then we have the desired contradiction. 

A 

Note that the expression u[ g\ H + u' q g q is equal to 

uiffi H h Tai^Wij )g il H 

+ TailKJSu. H H u q g q 

+ Hmono(w» 1 )gi 1 H 

+ Rmono(ui k )g Zk H 



-vi 



-v[Q 



-Zi'iSi 1- sj,i 



Hterm((/i 1 ) 



Si.fc: 



/i'ii j H 



"Hterm(giJ 

Thus we see that for all j (1 < j < q) 

( T a i\{ Uj )+v[Q /{,, + ••• + v[Q fl }j , ifje{ti,...,t fc } 
u j = S 

I u,- +v[Q fij + ■■■ + v[ Q fl tj , if j $ {ii,. ..,t fc } 

In both cases, Hterm(u^) Hterm^ ) < M, and u' has a height smaller 

than that of u, contrary to our hypothesis about the minimality of 
the height of the selected u. Thus 

5(G) =Rh + --- + Rtp~ 

and T is a basis for the syzygies of G. 
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Furthermore, our arguments also provide an algorithm that decides if 
a given u € R q is in the syzygy of a set G and, additionally, if possible 
(i.e., if u € S{G)), expresses u as a linear combination of the syzygy basis 
computed earlier. 



Input: u = (tii, . . . , u q ) € R q , 

G is a finite Grobner basis for (G), 

T = {ti, . . . ,t p } is a basis for S(G). 
Decide: Whether u £ S(G). If so, then compute the coefficients 

vi, . . ., v p € R such that u = vi t\ H + v p t p . 



if tti gi + ■ • • + u q g q =fc then 
return with failure; 



w := u\ 

{vi,...,v p ) := (0, ...,0); 
while w ^ (0, . . . , 0) loop 



L := |i a : Utena(w ia g ia ) = height (w) j; 
Comment: L = {ii , . . . , i k }; 

q . = height (w) 

W ' LCM(Hterm( 5ll ), . . . , Hterm( fli J) ' 
Compute sT, . . . , ~sj, a basis for 

the syzygies of {HcoeffcjJ, . . . , Hcoe% i)s )}; 



if := ^jj, : tj b — basis element in T for F and s 6 j-; 
Comment: if = {ji, . . . , j;}; 

Compute , . . . , £ S such that 

(Hcoe^WjJ, . . . , Hcoef(tii fe )) =v jl sTA + v n sj; 

(vi,...,v p ) := (v u . . . ,v p ) + (0, . . . ,v jl , . . . ,v n , . . . ,0); 



w :— w — Q 



v i\ tji + ■ ■ ■ + Vj l tj l 



end{loop }; 
return (vi, . . . , v p ); LI 



3.5.2 Syzygy of a Set: General Case 

Now, we are ready to consider the syzygy computation problem in the most 
general setting. 

Input: H = {hi, . . ., hi} C R; H — finite. 

Output: A finite basis {wT, . . ., IZv} for the i?-module S(H) C R l . 
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Let W denote the following r x I matrix over R: 





Wi 








w = 






W ri i . . . W r J 


, where Wij 


such that for all 1 < i 


<r, 












E Wi j h ■ = 

3 




and if 












E 


Uj h 


j = 0, where Ui, 


..,uiER 



then 



u = (ui,...,u t ) 

= v\ wi + ■ ■ ■ + v r uv , where v\ , . . . , v r G R. 

• Let G = {gi, . . ., g m } — a Grobner basis for (H). 

• Let T be given by 



T = 



' tl ' 








tp 




_ tp.l 





, where tjj € i? 



be a basis for the R- module S{G). This can be computed using the 
algorithm of the previous subsection. 

Let Y be an / x m matrix, Y = {yij} € R lxm 

2/1,1 • • ■ yi,m 



Y = 



via 



yi,r, 



such that 



hi 


= 2/1,151 + ' 


■ ■ + V\,m9m 


> i.e., 




= Y 


9i 




= yi.igi + ■ 


■ + yi, m g m J 




. h i . 




. 9m _ 



and let X be an I x m matrix, X = {xij} G i2* 



2^,1 
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such that 

91 = ari,ifti H \-xi,ihi 

9m = xi,mh\ H \-xi >m hi 

Both X and Y can be computed by the modified Grobner basis al- 
gorithm of the previous section. 

We now claim that W is given by the following ! x (I + p) matrix: 

Wl 
Wl 

W+i 
w i+p . 

Next we show that WT, . . ., Wi, indeed, form a basis for the i?-module 
S(H). 

1. Consider first Wi (1 < i < I) 

Wi,i hi-\ h w iti hi H h w it i hi 

= - (Vi,l Xl,l H h yi, m Xl tm )hl 

+ ... 

+ hi- (Vis X itl H h Vi,m Xi, m )hi 

+ ... 

- (Vi,i xis H h y itTn xi >m )hi 

= hi-y^igi y it m 9m 

= hi-hi = 0. 

2. Next consider wi(l + l<i<l+p) 

Wi,i h\-\ h Wij hi^ h wu hi 

= {U,l Xl t \ H + U, m Xl, m )hl 

+ ■■■ 

+ (U,i xis H 1- k tTn Xi. m )hi 

= U,l 91 + • • • + U,m 9m 
= 0. 



> i.e., 



9i 



X 1 



hi 
hi 



W = 



h - YX 1 
TX T 
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Thus S(H) DRW+--- + R M+t- 
Conversely, let u = (m, . . ., ui) e S(H), i.e., 

mi hi H h ui hi = 0. 

Let 

ui = "1 H \-uiyi tl 

Ihn = Hi i/l,m H \-Utyi im . 

Thus 

mi 5i H Vu m g. m = u 1 h 1 ^ \-u t hi =0, 

and there exist v[, . . ., v' p £ R such that 

(ST, ...,u^) =v[ *H Vv' v T v . 

We show that 

u = (ui,...,ui) 

= Ui Wi H h u/ ul; + v[ wi+i H h v' p wj^p~. 

Consider the j th component of the expression on the right-hand side: 

u\ wi,j H + ui wij + v[ H + v' p wi +p j 

= - iti(yi,i Xj,i H + j/i, m Xj, m ) 

+ '«j - Xj,i H h yj, m Xj, m ) 

- a^',1 H + 2/;, m a; jjm ) 

+ Xj,l H + tl,m £j,m) 

+ ••• 

-|- Vp{tp^\ H~ * * * ~h tp.m 3sj,rn) 

= Uj 

+ 2/1,1 itf j/i,i + u i ti,i H + * P ,i)a;j,i 

+ ••■ 

+ (— Wl 7/l, m — J/; iIn + ti, m H + D p t p ,m)Xj,m 

= Uj + {-Ul+Ul)Xj,l H + ( — + utn)Xj,m, 

= Uj. 
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Thus 

S(H) CRWT+--- + R Wi^ C S{H), 

and w\, . . ., wi+p is a basis for S(H). 

Using the arguments developed here, we can generalize the algorithm of 
the previous subsection. That is, we devise an algorithm which decides if a 
given u € R l is in the syzygy of an arbitrary subset H C R and, addition- 
ally, when possible (i.e., u e S(H)), expresses u as a linear combination of 
the syzygy basis W of H. 

Input: u — (iii, • • • ,ui) € R l , 

H = {h u . . . , hi} C R, H = finite. 

W — {wT, ■ ■ . , w i+p } is a basis for S(H), 

(as in this section.) 
Decide: Whether u £ S(H). If so, then compute the coefficients 

vi, . . ., v t+p £ R such that u = v\ wl + ■ ■ ■ + vi +p wi +p 

if uihi + ■ ■ ■ + uihi / then 
return with failure; 

Let G, T, X and Y be as defined earlier; 

Let Ux = Y!j=l U 3 Vj,l> ihn = Yl'j=l U 3 Vi,™, 

Compute v[ , . . . , v' p € R such that 

(wT, • • • , u^n) =v[ti-\ + v' p t p ; 

Comment: (ui, . . . , u^n) G S(G). 

return ...,«(,«!,... ,Vp). □ 

3.6 Hilbert's Basis Theorem: Revisited 

We can now summarize the discussions of the earlier sections to provide a 
stronger version of the classical Hilbert's basis theorem. 

Recall that earlier we defined a ring to be strongly computable if it is 
Noetherian, computable (i.e., it has algorithms for the ring operations), 
detachable (i.e., it has algorithms for the ideal membership problem) and 
syzygy- solvable (i.e., it has algorithms to compute a basis for the module 
of syzygies). We want to show that a polynomial ring over a strongly 
computable ring is itself a strongly computable. 

Theorem 3.6.1 If S is a strongly computable ring, then the polynomial 
ring R = S[x\, x 2 , ■ ■ ■, x n ] over S in n variables is also a strongly com- 
putable ring. 
PROOF. 

Our arguments depend on the existence of an algorithm to compute a 
Grobner basis for an ideal in R, with respect to some fixed but arbitrary 



104 



Computational Ideal Theory Chapter 3 



admissible ordering. In the previous sections we have seen that if the 
admissible ordering is computable, then this is possible, since S is strongly 
computable. Assume that the admissible ordering of choice is computable, 
e.g., purely lexicographic ordering. 

1. R is Noe.the.rian by Hilbcrt's basis theorem. 

2. R is computable, since it is straightforward to develop algorithms to 
(additively) invert a polynomial as well as to multiply and add two 
polynomials; these algorithms are based on the algorithms for the 
ring operations in S. 

3. R is detachable. See Section 3.4 on dctachability computation. 

4. R is syzygy-solvable. See the Section 3.5 on syzygy computation. □ 

3.7 Applications of Grobner Bases 
Algorithms 

In this section, we consider algorithms for various operations on ideals in 
a strongly computable ring, R. Thus we assume that all the ideals in this 
ring are presented in a Unitary way, i.e., by their finite bases. 

We have already seen how the ideal membership problem can be solved 
using Grobner basis. It is quite easy to devise algorithms for ideal congru- 
ence, subideal and ideal equality problems, simply building on the mem- 
bership algorithm. 

The operations sum and product are also trivial; the operations inter- 
section and quotient are somewhat involved and require the algorithm to 
find a syzygy basis. The operation radical requires several concepts, not 
discussed so far. 

3.7.1 Membership 

IdealMembership(/, H) 

Input: H C R and a polynomial / £ R, H = finite. 
Output: True, if / G (H). 

1. Compute G, the Grobner basis for H. 

2. Output True, if HeadReduction(/, G) = 0; otherwise False. 
The correctness follows from the fact that / e (H) iff / e (G) iff / — U 0. 
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3.7.2 Congruence, Subideal and Ideal Equality 
Ideal Congruence 

We define, for any ideal I C R, an equivalence relation = mod/ (congru- 
ence modulo the ideal I) over R, as follows: 

(v/,.gei?) [f = g mod I iff f-gel 



IdealCongruence(/, g, H) 

Input: H C R and polynomials f,g£R; H, finite. 
Output: True, if / = g mod (H). 

Output True, if / — g e (H) (using membership algorithm); otherwise 
return False. The correctness of the algorithm follows directly from the 
definition of congruence. 

Subideal 

Subideal(.Hi, H2) 

Input: Hi, Hi C R; Hi and H2, finite. 
Output: True, if (Hi) C (H 2 ). 

Output True, if hi G Hij hi e (_ff 2 ) (use membership algo- 
rithm); otherwise return False. 

Ideal Equality 

IdealEquality(.Hi , H2) 

Input: Hi, Hi C R; Hi and H2, finite. 

Output: True, if (Hi) = (H 2 ). 

Output True, if (Hi) C (H 2 ) and (if 2 ) C (i/x); otherwise output 
False. 

3.7.3 Sum and Product 

Ideal Sum 

IdealSum(J?i, #2) 

Input: i/i = {hi, . . ., h r }, H2 = {h r +i, . . ., h s } C R; Hi and H2, finite. 



Output: A finite basis H for the ideal (Hi) + (H 2 ). 
Output simply H = j/ii, . . . , h r , h r +i, . . . , /i s |. 
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Ideal Product 

IdealProduct(J?i, H 2 ) 

Input: Hi = {hi, . . ., h r }, H2 = {h r +i, ■ ■ ., h s } C R- Hi and H2, finite. 
Output: A finite basis H for the ideal (Hi) ■ (H2). 



3.7.4 Intersection 

lDEALlNTERSECTION(i?i, H2) 

Input: Hi — {hi, . . ., h r }, H2 = {h r +i, ■ ■ ., h s } C R; Hi and H2, finite. 
Output: A finite basis for the ideal (Hi) n (^2)- 
The main idea is to solve the following linear equation 



Output simply H 




u\ hi + ■ ■ ■ + u r h r = u r+ i h r+ i + ■ ■ ■ + u 3 h. 



(3.1) 



then the ideal (Hi) n (H 2 ) is equal to the following ideal 





. . . , u s J (tii, • • • , u s ) is a solution of the equation 3.1 





Let W be a syzygy basis for {hi, . . ., h r , h r+ i, . . ., h s }: 



Wl,l 



Wl, r Wl,r+1 



Wl, 



w = 



W p ,l 



p s 



We claim that 




• • + toi )f . h r ), . . . , (w Pi i hi + h w p .r h r ) 



Proof of the Claim: 



(=>) For^all 1 < i < p: 

hi = w it i hi H h Wi >r h r e (Hi) 

and 



Section 3.7 Applications of Grobner Bases Algorithms 



107 



hi = -«V+i h r +i Wi iS h s G (H 2 ), 

and for all i, hi G (iii) n (ii 2 )- 

(•$=) Conversely, assume that h G (i?i) n (i? 2 )- Then 

ft. = Ui hi + ■ ■ ■ + u r h r , (since h G {Hi)) 
= u r+ i h r+ i H + u s h s , (since h G {H 2 )) 

Thus 

ui hi + • • • + u r h r — u r+ i h r+ \ — ■ ■ ■ — u s h s = 0, 

and 

u= (ui,...,u r , —u r+ i, . . . , —u s ) G i? wT + hi? Wp~- 

That is, there exist v\, . . ., v p G i? such that 



Thus 



u = ui wi + • • • + Up w p . 

h = ui hi + ■ ■ ■ + u r h r 

= (vi wia H h v p w Pt ij hi 

+ ■■■ 

+ (vi W\ <r H h Vp w Pt ^j h r 

= vi {wi.i hi H h wi. r h^j 

+ ■■■ 

+ v p (w Pt i hi H h Wp. r h r ^j 

= Vi hi H Vvphp 

G p,...,/^). □ 



3.7.5 Quotient 
Ideal Quotient, {H) : (/) 

IdealQuotient(/, H) 

Input: f £ R, H = {hi, . . ., /i r } C R; H, finite. 
Output: A finite basis for the ideal (H) : (/). 
As before, the main idea is to solve the following linear equation 

Ui hi + ■ ■ ■ + u r h r = u r+ i /. (3.2) 
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Then the ideal (H) : (/) is equal to the following ideal 



|u r+ i: (3 ui, ...,u r ^j 



(til, ... , u r , u r+ i) is a solution of the equation 3.2 
Let W be a syzygy basis for {/ii, . . ., h r , /}: 
W = 

We claim that 

Proof of the Claim: 
(=>) For all 1 < i < p: 

m,r+i f = -Wi t i hi- w hr h r G (H). 



Wi 




101,1 • • 


• I0l, r 


Wl,r+1 






. W P,1 ' ' 


Wp^ r 


W p .r+1 


(Wl,r+1, • • 


■ , Wp, r +l) 


= (H): 


(/)• 



Hence w^ r +i € (-ff) : (/)■ 

(<*=) Conversely, assume that u e : (/), i.e., u ■ f e Then 



u ■ f = u\ h\ + ■ ■ ■ + u r h r 



and 



tt = (ui, . . . ,u r , —u) £ Rwi + ■■ ■ + R w p . 
That is, there exist v\, . . ., v p £ R such that 

u = vi wi + ■ ■ ■ + Vp Wp~. 

Thus 

U = -Vl ffil, r+ l Vp Wp ir+ 1 

e (wi, r+1 ,...,w p>r+ i). □ 



Ideal Quotient, (#1) : (H 2 ) 

lDEALQuOTIENT(i/i, H 2 ) 

Input: Hi = {hi, ■ . ., h r }, H2 = {h r +i, . . ., h s } C R; Hi and #2, finite. 



Output: A finite basis for the ideal (Hi) : (#2). 



Problems 
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Note that 

(Hi) : (H 2 ) = 1(11,) : £ (hj)\ = f) ((^) : (hj)). 

y j=r+l y j=r+l 

One direction follows from the arguments below: 

u G (Hi) : (ffe) « (#2) C (Hi) 

(V r + 1 < j < s) [u • ftj e (#1) 

s 



j=r+l 



Conversely, 

s 

j=r+l 

(Vr + l<j<sj u-hjGiHi) 

(V /l, ...,fr£R) [u (/l ftl + • •• + f r kr) € (#l) 

u^CiH,) 
=> u e (H!) : (H 2 ). 



Hence (£/i) : (#2) can be computed using the algorithm for computing 
the quotient, (H) : (/), and the algorithm for computing the intersection 
of two ideals. 



Problems 

Problem 3.1 

Let /1, f 2 , . . ., f s € K[x] be a set of univariate polynomials with coef- 
ficients in a field K. Show that the ideal generated by /j's has a Grobner 
basis with a single element (i.e., K[x] is a principal ideal domain). You 
may not use any fact other than the properties of the Grobner basis. 

In the case s = 2, what is the relation between Buchberger's algorithm 
to compute a Grobner basis and Euclid's algorithm to compute the g.c.d. 
of two univariate polynomials? 



Problem 3.2 

Prove the following: 
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(i) Let G C I be a Grobner basis of an ideal I of R = 5[a;i, . . ., x n ]. Is 
it true that for every / e R, |NF^(/)| = 1? Is it true that for every f el, 
|NF^(/)| = 1? 

Hint: Consider a Grobner basis {x 2 + l,y 3 + 1} for the ideal (x 2 + 
1, y 3 + 1) of Q[^, r>] (with respect to < )■ Can a polynomial, say x 6 y 6 + 

TLEX 

xy 2 + x + 1, have more than one normal forms (under the head-reduction) 
with respect to the given Grobner basis? 

(ii) A Grobner basis G m ; n for an ideal J is a minimal Grobner basis for 
/, if no proper subset of G m i n is a Grobner basis for /. 

A Grobner basis G s h r for an ideal / is a self- head- reduced Grobner basis 
for /, if every nonzero g € G s h r is head-reduced modulo G s h r \ {<?}• 

Let G be a Grobner basis for an ideal I of R such that ^ G. Show that 
G is a minimal Grobner basis for / if and only if G is a self-head-reduced 
Grobner basis for /. 



Problem 3.3 

A basis F for an ideal / is said to be a minimal basis for /, if no proper 
subset of F is also a basis for /. 

(i) Let F be a self- head-reduced Grobner basis such that \F\ < 2. Show 
that F is a minimal basis for (F). 

(ii) Consider the following basis F for the ideal (F) C rv]: 

F = {x 2 — y,xy — 1, y 2 — x}, 

Show that F is a self-head-reduced, minimal Grobner basis for (F) under 
any admissible total-degree ordering but not a minimal basis for (F). 
Hint: (x 2 -y,xy- 1, y 2 - x) = {xy - 1, y 2 - x). 



Problem 3.4 

Let S be a ring, let R = S[x\, . . ., x n ], let > be the fixed admissible 

lex 

ordering of choice on PP(xi, . . ., x n ). 

Let G = {gi, gi, . . ., g s } C i? be a finite set of polynomials, and f £ R 
an arbitrary polynomial such that 

G,h , G,h . G,/i G,h G,h 
J * Jl * 31 * ' ' ' * frn * Jm- 

Show that, if the d and D, respectively, bound the degrees (in each 
variable) of the polynomials in G and of the polynomial /, then 

m < (Jl + I y {d+1) n(n+l)/2 

Hint: If it — x^x^ 2 • • -x% n is an arbitrary power product, then we 
assign it a weight as follows: 

WgW = <*i(d + I)"" 1 + a 2 (d + I)"" 1 + • • • + a n (d + 1)°. 
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Let the weight of a multivariate polynomial be denned to be the biggest of 
the weights of its power products; that is, 

if / = aiwi + a 2 7r 2 H h a^i, then Wg(/) = maxWcft,). 

i 

The rest follows from the following two observations: 

1. Waifm) < Wcifm-x) <■< W G (/i) < W G {f). 

2. The Hmono(/i)'s are all distinct. 

Problem 3.5 

Let S = Noetherian, computable and syzygy-solvable ring. We say S 
is 1-detachable if, given {/i, . . ., f r } C S, there is an algorithm to decide 
whether 1 e (/i, . . ., f r ), and if so, to express 1 as 

1 = hi ■ fi + ■ ■ ■ + h r ■ f r , hi, . . . ,h r € S. 

Show that S is 1-detachable if and only if S is detachable. 

Problem 3.6 

Let S = Noetherian, computable and detachable ring. 

• We say S is intersection-solvable if, given Fi = {fi, f r } and 
F2 = {.fr+i, ■ ■ -j fs}, Fi, F 2 Q S, there is an algorithm to compute a 
finite basis for (Fi) n (F 2 ). 

• We say S is quotient-solvable if, given Fi — {fi, . . ., f r } and F 2 = 
{fr+i, ■ ■ •, f s }, Fi, F 2 Q S, there is an algorithm to compute a finite 
basis for (Fi) : (F 2 ). 

• We say S is annihilator- solvable if, given / g S, there is an algorithm 
to compute a finite basis for ann/. 

Show that the following three statements are equivalent: 

1. S is syzygy-solvable. 

2. S is quotient-solvable. 

3. S 1 is intersection-solvable and annihilator-solvable. 
Problem 3.7 

Let 5 be a Noetherian ring, such that, given a subset F C 5[t], there is 
an algorithm to compute a finite basis of the contraction of (F) to S, i.e., 



{F){S[t]}nS. 
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In this case, we say S is contraction-solvable. 

(i) Show that S is intersection-solvable. 

Hint: Show that if I\ and 7 2 are two ideals in S, then 

hni 2 = (th + (i-t)h){S[t]}nS. 

(ii) Show that if S is strongly computable, then S is contraction- 
solvable. 



Problem 3.8 

Consider an ideal M. C "L[r\^, x 2 , ■ . ., x n ] generated by a finite set 
M = {Mi, M 2 , . . ., M„} C Z[^ F , ar 2 , . . ., x n j. Let A e Z[^ F , x 2 , . . ., 
x n ] be a multivariate polynomial with integer coefficients, whose terms 
are ordered according to the lexicographic ordering, with the biggest term 
occurring first. 

If Hmono(Mi) divides Hmono(A) and 

A' = A * mono M Mj 



Hmono(Mi) 
Hmono(j4) 



Tail(Mi) +Tail(A), 



Hmono(Mj) 

then we say that Mj reduces A to A' and we denote this by the expression 

A^A'. 

Note that, as earlier, 

M,/i 



is the reflexive and transitive closure of A -^> (for some Mj <E M). 

A set of generators M = {Mi, M 2 , . . ., M v } of the ideal M. is an E-basis 
of the ideal if 

AAA A M > h n 

AeM A — UO. 

Let Mj and Mj be two distinct polynomials in the ideal A4. Then we 
define the S-polynomial of Mj and Mj (denoted, S(Mi, Mj)) as follows: 



m 

Hmono(Mj) "* % Hmono(M 7 )' 



where m = LCM{Hmono(Mj), Hmono(M 3 )}. 

For every nonempty subset M' = {M ix , . . ., M^} C M, we let 

q = gcd{Hcoef(M il ),...,Hcoef(M iM )} 
= aiHcoef (M h ) H h a M Hcoef(M v ) , 
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where q, 01, . . ., € Z, and we let, 

7r = LCM|Htcrm(M n ),...,Htcrm(M v )| 

whence, 

q ■ n = ai HteJ(M n ) Hm0n ° (M - l) + ' ' ' + ^ Ht CT rn r (Mv) Hm0n0(Mv) 
and clearly q ■ tt E Head(Mi, . . ., M„). Thus, for every such M' we define 

^ (M ' } = ^ HtermV,) ^ 1 + " " " + ^ HtermVu ^ " 
This leads us to define the \t expansion of M to be 

*(M) = *({Mi,...,M„}) 

= {Mi,...,M„} (J {V(M') :0£M' CM 

A (VI < i < i/) [Hmono(Mj) \ Hmono(^(M'))] } 

= {P 1 ,...,P X } = P, 

where we have removed duplicates or multiples with respect to the head 
monomials. 

Show that the following algorithm computes an _E-basis of an ideal 
M. C 7L\r\yt, X2, ■ ■ ., x n ] generated by a finite set M = {Mi, M 2 , . . ., M v }. 

_E-Basis Algorithm: 

Input: M C Z[.r^, . . ., x„\, 

M = finite. 
Output: P CZ[r^, . .., x„\, 

(P) = (M), and P satisfies the property (E). 

P := M; P := *(P); 

Pairs := {{M t ,Mj} : M t ,Mj G P and Mi / M,}; 

while Pairs 7^ loop 

Choose {Mi,Mj}, any pair in Pairs; 
Pairs := Pairs \{{Mi,Mj}}; 

Compute a normal form P of S(Mi, Mj) with respect to some 
choice of sequence of reductions modulo P; 

P = NF&(S(Mi,M,-)); 
if P ^ then 

P := PU{P}; P := *(P); 
Pairs := {{Mi,Mj} : M t ,Mj G P and M; £ Mj}; 
end{if }; 
end{loop }; 
return P; 
end{S-Basis Algorithm}. □ 
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Problem 3.9 

Let S = K be a field and R = K\xi, . . ., x n ] be the ring of polynomials 
in the variables x\, . . ., x n over K. 

Given two polynomials /, g € R, we say / is completely reducible by g 
if Hmono(g) divides some monomial m in /. Say m = a- Hmono((/). Then 
we say the polynomial h = f — a ■ g is the complete-reduct of / by g and 
denote the relationship by 

/ — >h. 

If G is a set of polynomials, we write / -^H h if / — > h holds for some g e G. 
If there is a finite sequence hi, . . ., h n (n > 1) such that hi = /, h n = h 

and hi — ^ for i = 1, . . . , n — 1, then we write 

f — >h. 

If / is not reducible by any g e G, we indicate this by writing / — I f. We 
say h is a normal form of f modulo G under complete-reduction (briefly, 

complete-normal form) if / — h — ^ h, and we write NFq(/) for the set 

of all complete-normal forms of / modulo G. 
Show that 

1. The complete-normal form of / is not unique in general. That is, it 
is possible that 

|NF^(/)| > 1. 

2. The complete-normal form of / is well-defined. That is, it is not 
possible that 

|NF^(/)|=0, 

(i.e., the complete-reduction process always terminates). 
Problem 3.10 

Consider the following set G — {gi, . . ., g n +i} Q Q[^V, • • ■, x n ], a 
polynomial / G Q[^f, • • ■, x n ) and the admissible ordering > . 

lex 

xi> x 2 > ■ ■■ > x n . 

lex lex lex 

Let d > and D > be two positive integers. Assume that 

gi = Xi 2^2 2 '3 ' ' ' X n 

92 = x 2 -xf--- x d n 

9n-l = 
9n = 

g n+ i = x n -l 
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and 

Show that there is a complete reduction sequence 

/G : c p G ,c j, G ,c G.c j, 
> J 1 ► J2 — ~* ■ ■ ■ — ~* Jm ? 

such that 

to > 2 ( d+1 ) n " lD . 



Problem 3.11 

Let if be a field and 7? = if [xi, . . ., x n ] be the multivariate polynomial 
ring, as in Problem 3.9. Let /, g e R; we define the ^-polynomial S(f,g) 
of / and 5 as follows: 

(J,9) Hmono(/)' ; Hmono( 5 )' 9, 

where m = LCM(Hterm(/), Htcrm(g)). 

Let i C R be an ideal in R. Show that the following three statements 
are equivalent: 

1. G C I and Hcad(G) = Hcad(i). 

2. G C 7 and for all f e 7, 



3. (G) = 7andforall/, ff eG(/^ 5 ), 



Problem 3.12 

Let if be a field and R = K [xi, . . ., x„] be the multivariate polynomial 
ring, as in Problem 3.9. 

(i) Show that the following two statements are equivalent: 

1. G is a Grobner basis for I. 

2. (G) = 7 and for all / e R, |NF^(/)| = 1. 
Hint: You may need to show by induction that 

f£$g iff / = 5 mod(G). 

(ii) Let (F) C K[x\, . . ., x n ] be an ideal and = mod the usual 
congruence relation on R = K [xi, . . ., x„]. 
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A canonical simplifier for = mod (F) on R is an algorithm G with input 
and output in R such that for all /, g G R, 

f = C(f) mod (F) 

and 

f = gmod(F) C(f) = C(g). 

Notice that the function G gives a unique representative in each equivalence 
class of T I ~. We call G(/) a canonical form of /. 
Devise a canonical simplifier algorithm. 



Problem 3.13 

Let K be a field and R = K[x\, . . ., x n ] be the multivariate polynomial 
ring, as in Problem 3.9. A set G C R is a minimal Grobner basis of the 
ideal (G), if 

G \ {g} is not a Grobner basis of (G) . 

Show that if G and G' are two minimal Grobner bases for the same 
ideal, then they have the same cardinality, |G| = |G'|. 

Hint: Show that (1) the set of head terms in G is equal to the set of 
head terms in G' and (2) no two polynomials in G (or G') have the same 
head term. 



(VgeGj 



Problem 3.14 

Let K be & field and R = K[x\, . . ., x n ] be the multivariate polynomial 
ring, as in Problem 3.9. A basis F C R is self-reduced if either F — {0} or 
else ^ F and 

/ \ r f\{/},c i 

(V/GF) [/ ---^ f 

We call a Grobner basis G C R reduced if 

1. either G = {0} or else for all g € G, Hcoef(<?) = 1; 

2. G is self- reduced. 

(i) Devise algorithmic procedures to compute the self-reduced and re- 
duced Grobner bases of an ideal (F). 

(ii) Prove that the reduced Grobner basis of an ideal in K [ x\ 7 . . . , x n \ 
is unique (relative to choice of the admissible ordering). 

Note: You may want to use the fact that a reduced Grobner basis is 
a minimal Grobner basis. 
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Problem 3.15 

Consider the ideal / = (xy + y, xz + 1) C Q[F, rx, Use the lexico- 
graphic ordering, with 

z < y < x. 

LEX LEX 

(i) Show that the following is a minimal Grobner basis for /: 

G = {xy + y, xz + l,yz- y}. 

(ii) Prove that that G is also a Grobner basis for / (with respect to 
degree ordering of x) when it is considered as an ideal in the polynomial 
ring (Q[F, rv])[^] with variables x and coefficients in Q[F,rx]. 

(iii) Show that G is not a minimal Grobner basis for I{(Q[F, ^])[^]}- 
Compute a minimal Grobner basis G' C G for I{(Q[F, ^])[^]}- Is G' also 
a Grobner basis for I{Q[F, rx, ^]}? 

Hint: Show that 

Hmono x {xy + y) = xy 

= V{xz) - x(yz - y) 

= y HmonOj; (xz + 1 ) — x Hmono^ (yz — y), 

where Hmono x (f) = head monomial of / e (Q[F, when / is 
treated as a univariate polynomial in x. After throwing (xy + y) out of 
G, you can obtain a minimal Grobner basis. 

Problem 3.16 

Consider the polynomial ring R = S[x\, . . ., x n ]. The homogeneous 
part of a polynomial / £ R of degree d (denoted fd) is simply the sum of 
all the monomials of degree din f. A polynomial is homogeneous if all of its 
monomials are of same degree. An ideal / C R is said to be homogeneous, 
if the following condition holds: / e I implies that for all d > 0, fa £ I 

(i) Prove: An ideal I C R is homogeneous if and only if I has a basis 
consisting only of homogeneous polynomials. 

(ii) Given an effective procedure to test if an element / e R belongs to 
an ideal / C R, devise an algorithm to test if / is homogeneous. 

(iii) Let I C R be a homogeneous ideal, and G a Grobner basis for I 
(under some admissible ordering > ) . Define 

A 

G'd = |.9<2 : 9d is a homogeneous part of some g e G of degree dj, 

and 

+oo 

G'= (J G' d . 

d— — oo 
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Prove that G is a homogeneous Grobner basis for I (under > ) . 

A 

(iii) Let K = field. Show that an ideal I C K[xi, . . ., x„] is homoge- 
neous if and only if it has a reduced Grobner basis, each of whose element 
is homogeneous. 



Problem 3.17 

Let G be a homogeneous Grobner basis for I with respect to > , in 

TRLEX 

S[x\, . . ., x n }. Define G^ as follows: 

Let g £ S[xi, . . ., x n ] be a polynomial, and d £ N a positive integer. 
Then 



9 {d) = < 



, if q is divisible by x™ , 



but not by x™ +1 , for some < m < d, 



otherwise. 



and 



Show that G^ is a homogeneous Grobner basis for / : (x^) with respect 



to > . 

TRLEX 



Problem 3.18 

We define a polynomial expression over the ring Z involving n variables 
Xi, X2, . . ., x n as follows: P = 1, P = Xi (i £ {1, . . . ,n}) are polynomial 
expressions; if P and Q are two polynomial expressions, then so are a\ ■ 
P + a-i ■ Q and a\ ■ P ■ Q (ai, 02 £ Z). 

Example: (x 2 — y 2 ) — (x + y)(x — y — 1). 

With each polynomial expression, P, we associate the polynomial ob- 
tained by expanding the polynomial into the simplified form and call it 
P — for instance, the polynomial associated with the expression in the ex- 
ample is x + y. 

Given a polynomial expression Pi, let d\ be the degree of x\ in Pi, and 
P2X1 1 be the corresponding term in P\ (considered as a polynomial in xi 
over the ring Z[^jz£, . . . ,^ K ]); and let di be the degree of x 2 in P 2 , and so 
on, up to d n . 

(i) Suppose P is not identically zero. Let Ii C Z be an interval in Z 
(i £ {1, . . . , n}). Show that in the set 7i x I 2 x • • • x 7„ C Z K , P has at 
most N real zeroes, where 



W -|/.x/,x...x/.|(* + * + ... + *) 
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(ii) Let P be a polynomial expression, not identically zero, and C > 1 
a constant. Let I = I\ = ■ ■ ■ = I n be intervals in Z such that |7| > C ■ 
deg P. Show that the probability that P evaluates to zero at a (uniformly) 
randomly chosen point in I n is bounded by C _1 from above. 

(iii) Devise a probabilistic algorithm to test whether a given polynomial 
expression is identically zero. 
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Solutions to Selected Problems 

Problem 3.1 

Lemma: Given I = ■ ■ ■ ,/ s ) is an ideal in K\x), where K is a field, 

if f = x n + a\x n ~ x H + a n is a monic polynomial of minimal degree in 

I, then G = {/} is a Grobner basis for I. 



Note first that / has the minimal degree among the polynomials of I, since 
if there exists an /' G 7, deg(/') < deg(/), then /' /Hcoef(/') G I would 
contradict our choice of /. 

Since {/} C 7, it is sufficient to show that Head(I) C Head({/}). For 
any g — b x m + b 1 x m ~ 1 + ■ ■ ■ +b m G I, Hmono(<7) = boX m , and n < m by 
choice. Thus, Hmono(/) | Hmono(g), and Hmono(<7) G Head({/}). □ 

Case s = 2: Both head reductions and 5-polynomials do the same 
thing and correspond to one step of polynomial division. Suppose = 
a x n + a\x n ~ x + ■ ■ - + a n , and fa = b x m + b\x m ~ l + ■ ■ - + b mi where n < m; 
then the following are all equivalent: 



Because the quotients are restricted to monomials, we get only one step of 
a complete polynomial division of the type occurring in Euclid's algorithm. 
We observe that it is never necessary to multiply both /j and by a 
power product to find the LCM of the head monomials. 

When we run the Grobner basis algorithm starting with two univariate 
polynomials, the 5-polynomial computations generate remainders, which 
then get reduced to normal-forms. The normal-form computations apply 
head reductions which again compute remainders. The algorithm may be 
viewed as a "disorganized" Euclidean algorithm, in which remaindering is 
done in a nondeterministic fashion. As soon as the g.c.d. of the inputs ap- 
pears (as it must, since we can simulate the Euclidean g.c.d. computation 
by making the right nondeterministic choices), then all the normal forms 
of 5-polynomials necessarily reduce to zero, and the algorithm terminates. 
As we are not trying to produce a "reduced" Grobner basis, it will contain 
along with the g.c.d. also the input polynomials and all the reduced re- 
mainders generated along the way. The g.c.d. can be extracted by simply 
searching for the lowest-degree polynomial in the basis. 



PROOF. 





fi * S(fi, fi-l). 
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Problem 3.2 

(i) In general, for / G R, NF^(/) is not unique. For instance, if we 
head- reduce x 6 y 6 + xy 2 + x+1, first using x 2 + 1 as long as possible and 
then using y 3 + 1, we get xy 2 + x. On the other hand, if we head-reduce 
the same polynomial, first using y 3 + 1 as long as possible and then x 2 + 1, 
we get xy 2 + x 2 + x + 1. Thus, 



Let f el, and /' € NF^(/). Thus, /' G 7. If /' ^ then Hmono(/') G 
Head(G), and /' is head-reducible. As /' is a normal-form, /' must be 0. 
Therefore, NF^(/) = {0}. 

(ii) Let G m in be minimal Grobner basis, which is not a self-head- 
reduced Grobner basis. Then there is a nonzero g G G m ; n which is head- 
reducible modulo G m i n \{g}. Thus, Hmono(g) G Head(G m i n \ {g})- There- 
fore, 

Head(J) = Head(G) = Hcad(G min \{ 3 }) + (Hmono(. 9 )) = Head(G min \{ 3 }), 

and G m ; n \ {g} $1 G m ; n is a Grobner basis for I, which contradicts the 
minimality of G m j n . 

Conversely, let G s h r be a self- head- reduced Grobner basis, which is not 
minimal. Then there is a nonzero g G G s hr such that G' = G s hr \ {<?} is a 
Grobner basis for I. But then 



and g is head-reducible modulo G s h r \ {<?}, which contradicts the self-head- 
reducibility of G s h r - 

Problem 3.5 

Let S be Noetherian, computable and syzygy-solvable ring. If S is 
detachable, then obviously it is also 1-detachable, as 1-detachability is a 
special case of detachability. 

Claim: Let f\, . . ., f r , s € S, and let the syzygy-basis for {f\, . . ., f r , s}, 
bevT, . . ., v^, where, for i = 1, . . ., p, vl — (wis, . . . , Wi, r +i) G S r+1 . Then 

1. 1 — ui wi^ r+ i + ■ ■ ■ + Up Wp ir+ \, for some Ui G S 



NF%(x 6 y 6 + xy 2 + x + I) > 1. 



Hmono(g) G Head(I) = Head(G') 



v 



=> S = tl fl H \-U f, 



where U = — 




2- leKr+iv.V+i) «e (/i,...,/ r ). 
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Proof of Claim The proof is as follows. Let 1 = m wi, r +i+- • -+u p Wp, r +i- 
Since, for all j — 1, . . ., p, 

Wj,i fi H h Wj >r f r + Wj,r+1 S = 0, 

we have 

Ul (wi,l ,/lH h Wl,r /r + Wl,r+1 H 

+ Up (w Pj i fi H h iu Pir / r + Wp, r+ i = 

p p p 

=> *^2 u i w i,i /H 1- X! Mi Wi > r f r + X] Ml "V+i s = 



=i »=i 
-*i /i t r / r + s = 

= h fl + ' ' ' + U f r - 



Thus 



1 e (tOl, r+ l, . . . ,Wp, r +l) S € (/l, . . . ,/ r ). 

Conversely, assume that s£ (fi, . . . , f r ). Thus, s = ti /i H h £ r / r , and 

— /i — • • • — t r f r + s = 0. Since Ui, . . ., Up~ is the syzygy-basis, we can 
find ui, . . ., Up such that 



-t r , 1) = ui vi 



lip Up . 



Thus, Ul Wi.r+l + ■ ■ - Up W Py r+l = 1. HcnCC, 1 G (tUi jT . + i, . . ., Wp.r+l)- 

(End of Claim.) □ 

Let us assume that S is 1-detachable. Let fi, . . ., f r , s G 5, and let 
the syzygy-basis for {/i, . . ., / r , s}, be UT, ■ • ., v^, as before. (The syzygy- 
basis can be computed as S is a syzygy-solvable ring.) If 1 ^ (ioi, r +ij • • ■, 
Wp.r+i), then s £ (fi, . . ., f r ). Otherwise, we can express 1 as 

1 = ui Wi, r+ i H h Up Wp, r+ i, 

using the 1-detachability of S. But by the claim, we see that 

S = ti fi H ht r /r, 

where U = — Y^j=i u j w j,i- Thus 5 is detachable. 
Problem 3.6 

(1) => (3): See the application section (§3.7), in particular the sub- 
sections on intersection (Subsection 3.7.4, pp. 106) and quotient (Subsec- 
tion 3.7.5, pp. 107). 

(3) => (2): Note that 

(Fi) ■ (F2) = ((Fi) : £ (fi)) = fl : (fi))- 

\ j=r+l I j=r+l 
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Also, for each fj(j = r + l,...,s),if Cj is a basis for (Fi) n (fj), and Dj 
is a basis for ann(/ J ), then 

Bj = | : r- • r.j /.), 

is a basis for (Fi) : (fj). As 5 is intersection-solvable and annihilator- 
solvable, all the Bj's can be computed; and as S is intersection-solvable, a 
basis for (Ft) : (F2) can also be computed. Thus 5 is quotient-solvable. 

(2) => (1): Let F» = . . .,6^p.} be a basis for thc idcal (°) : 

(/ a ) = ann(/ s ). 

Let C S s be a set of s-tuples given by 

{ (o___36i:i), 

s-l 

(0 1 _^ 1 _0,6il) }. 

s-l 

Note that both F» and are constructible as the ring S is assumed to 
be quotient-solvable, and for each CD = (0, . . . , 0, 6^]) € S^, 

s-l 

0/i + -"+0/._i+6i'j / s = 0. 

Hence we . . ., / s }). 

Now, for r (1 < r < s), let F» = jfc^, . . . , 6^ r | be a basis for the 

ideal (f r+1 , . . . , f s ) : (f r ). 

Assume that for each j £ {1, . . ., p r }, 

b% fr = -b% ltj fr+1 6jj /s € (jV+1, • • • , / s ), 

i.e., 

^^+^ lJ /r+l+"-+6S/, = 0. 
Let 23^ C 5 s be a set of s-tuples given by 

I /o 6 (r) 6 (r) 6 (r) \ 

r-1 

^ U, ■ „ , U , O r ,p r , °r+l,p r ' • • • ) °s,Pr / J- 
r-1 
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Note that can be computed as S is quotient-solvable; 6^+1, j' ■ ■ ■' "a,j 
(1 < j < p r ) can be computed as S is detachable. Thus, finally, the set B^ 
is constructible for a Noethcrian, computable, detachable and quotient- 
solvable ring S. Also, for each Q = (0, . . . , 0, bf], ft^ij. ■ ■ ■> 6 S> e B(r) > 

r-l 

/H h / r _!6g /r + /r+l + • • • + /- = °' 

Hence we S({f u . . ., /,}). 

Now, we claim that the set B = B {1) U B [2) U • • • U B {s) is in fact a 
syzygy basis for {/1, /2, . . ., /«}. We have already seen that if w G then 
w G S({fi, ■ ■ ., /«})■ Thus, it remains to be checked that every s-tuple 
(ci, C2, . . . , c s ) £ S s satisfying the condition 

ci fi + c 2 h H h c s ,/ s = 

can be expressed as a linear combination of the elements of Z?. Assume to 
the contrary. Then there is an s-tuplc 7 = (0, . . . , 0, c r , . . . , c s ) (r possibly 

r-l 

1) in S({f\, . . ., f s }), not expressible as a linear combination of the elements 
of B; assume that 7 is so chosen that r takes the largest possible value. 
We first notice that, since 

c r fr c r+l /r+l ' c s fs ^ (/r+l} ■ • ■ > /s), 

it follows that c r e (/ r +i, ■ ■ ■ , fs) ■ (fr)- Thus 

c r = mi 6^ H h u Pr b^. 

Now, consider the s-tuple 7' = (0, . . . , 0, c' r+1 , . . . , d s ), where 

r 

C 'r+1 = c r+l - (ui b { j2 1A + ---+U pr 6^ 1)Pr ) , 

4 - ^-(uiftg + .-. + u^feWj. 

But 

4+1 /r+l H 1" c' s fs 

= Cr fr + Cr+1 /r+l H h C s / s 

= 0. 
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Thus 7' € S({fi, . . ., f s })- Since 7 is not expressible as a linear combination 
of the elements of B, and since (7 — 7') is a linear combination of the 
elments of B^ C B, the s-tuple 7' itself cannot be expressed as a linear 
combination of the elements of B. But this contradicts the maximality 
of the initial prefix of O's in the choice of the s-tuple, 7. Indeed, every 
element of S({fi, . . ., f s }) must be expressible as a linear combination of 
the elements of B, and B is a syzygy basis for {/1, . . ., /„}, as claimed. 
Thus the ring S is syzygy-solvable. 

Problem 3.7 

(i) We first prove the statement in the hint: 

ce (ti u (i-t)i 2 ){S[t}}nS 

i=0 i=0 i=0 

k 

c=(a k - b k )t k+1 + ^(fli-i + h - bi-^f + b eS 
i=i 

a k -b k = a k -i + b k - b k -i = ■ ■ ■ = ao + b\ - bo = 0, 
and c^doeS 

bo = ao + ai H h a k = c G S 

c e Ii and c£/ 2 

^ cehni 2 . 

Therefore, contraction-solvability of £ implies intersection-solvability of 5*. 

(ii) It was shown in Section 3.3 that the strong-computability of S 
implies that a Grobner basis G for (F) C S[t] can be computed with 
respect to the admissible ordering > (in this case, it is simply the degree 

LEX 

ordering) . We now claim that G D S is a Grobner basis (also finite) for the 
contraction of (F), (F){S[t]} n S (in S). Thus if S is strongly computable 
then S is contraction-solvable. 

To justify the claim, we make the following observations: 

(v / e S[t]) Hmono(J) eS f e S 

(i.e., if the highest-order term in / does not involve t, then no term of / 
involves t, and vice versa). 
Thus, 



Hcad(G n S) = Hcad(G) n5 = Head(J) n S = Head(7 n S). 
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Problem 3.9 

(i) Let G = {x 2 - 1, x 2 - x} and let / = x 2 , then NF£(/) 2 {1, a:}, 
showing that |NF^(/)| > 1. 

(ii) In this case, we have to simply show that the complete-reduction 
process always terminates. We start with a definition. Let X be any set 
with a total ordering < and let S(X) be the set of all finite decreasing 
sequences of elements of X: 

S(X) = {(xi, . . . ,x n ) : Xi e X,xi > x 2 > ■ ■ ■ > x n } . 

Let S(X) have the following induced total-ordering: 

(xi, ■ ..,x n ) <' (j/i, . . .,y m ), 

if either for some i < min(n, m), x\ — yi, ■ ■ ., Xi — yi and x i+ i < j/j+i, 
or else the sequence (xi, . . ., x„) is a prefix of the sequence (yi, . . ., 
(thus, n < m). 

CLAIM: If X is well-ordered by <, then S(X) is well-ordered under the 
induced ordering. 

Proof of Claim For the sake of contradiction, suppose <j\ >' o"2 >' ■ ■ ■ is 
an infinite descending chain in S{X}. Let o~{ = (x^i, . . ., Xi n ^ ). There 
are two cases. 

(i) The n(i)'s are bounded, say k = max{n(i) : i = 1,2, • • •}. We use 
induction on k. We get an immediate contradiction for k = 1, so assume 
fc > 1. If there are infinitely many i's such that n(i) = 1, then we get a 
contradiction from the subsequence consisting of such crj's. Hence we may 
assume that the n(i)'s are all greater than 1. Now there is an i such that 
for all i > io, x it i = x i+ i^. Let a[ = {x it2l ■ ■ ■, ar»,n(»)) De obtained from 
&i by omitting the leading item in the sequence. Then the sequence a\ , 
Oi +i, ••• constitutes a strictly decreasing infinite chain with each a\ of 
length < fc. This contradicts the inductive hypothesis. 

(ii) The n(i)'s are unbounded. By taking a subsequence if necessary, 
we may assume that n(i) is strictly increasing in i. Define m(l) to be the 
largest index such that x m n\i — Xj.i for all j > m(l). For each i > 1 define 
m(i) to be the largest index greater than m(i — 1) such that x m ^ t i = Xj_i 
for all j > m(i). Note that the sequence 

x m(l).l, X m (2),2i X m(3),3> ■ ■ ■ 

is strictly decreasing. This contradicts the well-foundedness of X. 
(End of Claim.) CJ" 

Now to see that the complete-reduction process terminates, we pro- 
ceed as follows: We map a polynomial g to the sequence of monomials 
g = (mi, . . ., mk), where m^s are the monomials occurring in g and 
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mi > 7712 > 



>mfc. By our claim, the set of <?'s are well-ordered un- 



G,c 



dcr the induced ordering >'. It is seen that if g — '-+h, then g>'h. The 

A "A 

termination of the complete- reduction is equivalent to the wcll-foundedness 
of the induced ordering. 



Problem 3.11 

(1) => (2): 



G C I and Hcad(G) = Head(J) 

=> (v/el) 



f G 2 H Q 



(V/€J) 



, . ,G,h 

(since / — ► g 



, G,c . 

/ — ><?•) 



(2)^(3): (i) 

(v / e / 



which implies / C (G) but G C J, therefore (G) = J. 
(ii) 



(v/,seG,/?tff) 



777 



/ 



m 



Hmono(/) ' Hmono(g) 



9 G / 




(3) (1): As (G) = I, G C I. Let, for each F C G, SP{F) stand for 
the set of S-polynomials of F. 



(v/,jeG,//j) [S(f,g)^0 
=> (V F C g) (V fr F e SF(F)) 
(V F C g) (V h F e S'F(F) 



/7 F ^>0 



such that 



Hterm(/7 F ) > Hterm(/ i g i ) for all i 

A 

=> G satisfies the syzygy-condition. 
It then follows that Head(G) = Head(J). 



Problem 3.12 

(i) (1) (2): Let g,g' e NF£(/). Then f — g € I and / - g' e J 
and therefore g — g' & I. Then by previous part 5 — 5' — ^ 0. But 5 — 5' is 
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in normal form with respect to complete- reduction as g, g' are themselves 
in normal form. Hence g — g' = 0, and g = g'. Since, for all / G i?., 
|NF G (/)| > 0, we see that |NF G (/)| = 1. 

(2) =* (1): 

We begin by proving the statement in the hint. 
CLAIM: For all G C i?, and /, 5 G i?, / = g mod (G) i/ and only if 

, G,c 

f< — >g- 

* 

Proof of Claim 

(<=) 

This is easily shown by induction on the number of steps between / and g. 
Let 

„ G,c G.c G,c 

J = 9o < * .91 ■ ■ ■ < * 9k = 9 

for some k > 0. The result is trivial for k = 0. Otherwise, by induction, 
51 — gk <S (G) and it is seen directly from the definition that g — g\ G (G). 
Thus g = g k mod (G). 

(=>) 

If / — g G (G), then we can express / — g as Y^ILi a iUfi, where each 
a, G if, and ij is a power-product and the /j are members of G. If m = 
the result is trivial. If m > 1 then we can write g' = g + a m f m and 

f — g' = Yll^i a itifi- By induction hypothesis, /<— ^<7 ; . We also have 

G c 

that </ — g = a m t m f m — ^ 0. Let t = Htcrm(i m / m ). It is clear that t occurs 
in g and g' with some (possibly zero) coefficients a and a' (respectively) 

such that a m = a' - a. Thus g^(g- at m f m ), and g' ^{g 1 - a't m f m ), 

i.e., 5<^>.g'- This shows / <— > .g' <— > ,g. 

(i?nd 0/ Claim.) IZI 

Now, going back to our original problem, we see that since (G) = I, 
G C 7. Furthermore, 

/€/ / = 0mod(G) 

=► /^0 

G,c G.c G.c 
/ = .90 .9i < — ► ' ' ' < — > 9k = 

V0<Kfc) [NF&Gfc) n NF G ( ffm ) ^ 
NF G (/) = NF G ( 5i ) - NF G ( ffm ) = NF c G (0) = 0, 
for alH, < i < k, 
(since \NF c G (gi)\ = 1, for all i, < i < k) 

=> f^o. 

Hence GC/, and for all / G /, / 0, and G is a Grobner basis for /. 
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(ii) Let G be a Grobner basis for (F). Then use the algorithm that 
on input of / e K[x\, . . ., x n ] produces its normal- form (under complete 
reduction) modulo G, i.e., C(f) — NFq(/). The rest follows from the 
preceding part and the well-known properties of Grobner bases. 



Problem 3.14 

(i) The following two routines terminate correctly with the self-reduced 
and reduced Grobner bases of an ideal (F). 



SelfReduce(F) 

Input: F a finite set of polynomials in K[xi, . . ., x n ]. 
Output: R a self-reduced basis for (F). 

loop 

R :— 0; self-reduced := true 
while F =fi loop 

Choose / from F and set F := F\ {/}; 
9 :=NF^ UF) (/); 
if g ^ then R := Ru {g}; 
self-reduced := (g — /) and self-reduced; 
end{while } 
F := R; 
until self-reduced 
end{loop }; 
return (R); 
end{SELFREDUCE}. □ 



The routine SelfReduce terminates and is correct. 

In each iteration of the inner while- loop (except for the terminating it- 
eration) there is a selected polynomial / that must be subject to a reducing 
transformation, i.e., g ^ NF^ ui? ^(/). If / = / , /i, . . ., constitute the suc- 
cessive transformed versions of /, then it is easily seen that the sequence 
of /j's is finite. Since this is true for every / in the original F, there can 
only be a finite number of iterations. 

The correctness follows trivially from the definition, once we observe 
that the ideal (F U R) remains invariant over the loops. 
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Reduce(F) 

Input: F a finite set of polynomials in K[xi, . . ., x n ]- 
Output: G a reduced Grobner basis for (F). 

G ■— SelfReduce(F); 

B:= {{/,<?] :./•// G,f ■• //J: 

while loop 

Choose {f,g} to be any pair in B; 

B:=B\{{f,g}}; 

h:= S(f,g); 

h! := NF^(ft); 

if h' then 

G := SelfReduce(G U 
/,': !!././/!:/.;/ (.'../ - <?}; 
end{if }; 
end{while }; 
for every / G G loop 

G:= (G\{/})U{//Hcoef(/)}; 
return (G); 
end{REDUCE}. □ 

T/ie routine Reduce terminates and is correct. 

The termination and the fact that the output G of the routine is a 
Grobner basis of (F) can be proven easily in a manner similar to the proof 
for the Grobner basis algorithm. (Use the ascending chain condition and 
syzygy condition.) When the algorithm terminates, clearly the basis is 
self-reduced and each element of the basis is monic. 

(ii) 

Theorem The reduced Grobner basis of an ideal in K [x\, . . ., x n ] is 

unique (relative to the choice of an admissible ordering). 

PROOF. 

Let G, G' be two reduced Grobner bases for the same ideal. We obtain a 
contradiction by supposing that there is some polynomial g in G — G' . By 
the preceding problem, there is some other polynomial g' in G' — G such 
that Hmono(g) = Hmono(g') [recall that Hcoef(g) = 1 = Hcocf(c/')]. 
Let h — g — g'. Then 

h^O and h^O, 

since G is a Grobner basis. So some term t occurring in h can be eliminated 
by a complete- reduction by some / € G. Now t must occur in g or g' . If t 
occurs in g, then g is reducible by /, contradicting the assumption that G is 
reduced. If t occurs in g' then let /' <G G' such that Hterm(/') = Hterm(/). 
Again g' is reducible by /', contradicting the original assumption that G 1 
is reduced. □ 
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Problem 3.18 

(i) We can write P(xi, . . ., x n ) as 



P{X) = P{xi, ...,x„) = Pd 1 (x2, ■ ■ ..inK 1 H h Po(a;2, ■••,£„) 

If P(X) ^ 0, then for a fixed value of X, P(X) can be zero because of two 
reasons: (a) x\ is a root of the univariate polynomial P{X) with Pj(x2, 
. . ., x n ) (0 < j < di) as coefficients, or (b) for each j (0 < j < d\), Pj(x2, 
. . ., x n ) — 0. For a fixed value of X2, ■ ■ ., x n , there are only d\ zeroes of 
P(X) and each Xj can assume only one of the \Ij\ values; therefore, there 
arc at most d\ ■ I1J=2 Vj\ zeroes of the first kind. The total number of 
zeroes of the second kind are obviously bounded by the number of zeroes 
of Pd 1 (x2, ■ ■ ., x n ). Let Z[P(X)\ denote the zeroes of P(X). Then we have 
the following recurrence: 

n 

\Z[P(X)}\ < di- niij-l + \Z(P dl (x2,...,x n ))\-\h\ 

3=2 



Therefore, 



\Z(P(X))\ \Z(P dl (x 2 ,...,x n ))\ di 

— -i—r n i -r i ~r 



nu\i 3 \ - hum 

< 



Lin 

j=i 1 jl 



which gives the required inequality. 

(ii) Since there are at most JJj=i l-^'l total possible values of P, the 
probability 

p = Pt[P(X) = : P^0] < |Z(P)I 



n?=iW 

If |7j| = |/| (1 < j < n), then the probability is bounded by 

p = Pr[P(X) = 0:P^0] < = |7|E^ = ^ de S( P ) 

Therefore, if |7| > C • deg(P), then p < C" 1 . 

(iii) One possible way to check if P = is to choose xi, . . ., x n randomly 
and evaluate P at this set of values. If P(X) ^ 0, then obviously P ^ 0; 
otherwise, return with P = 0. If the algorithm returns P = 0, then the 
probability of error is bounded by p. Therefore, if we choose |/| = [I ■ 
deg(P)], then the algorithm returns the correct answer with probability 
> 1 — e, for any < e < 1. 
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However, this algorithm is not very practical because \I\ may be very 
large, in which case all sorts of computational problems arise. One simple 
solution is to evaluate the polynomial for several sets of values, instead of 
evaluating it for only one set of values. However, we need to repeat the 
steps only if P(X) = 0. Therefore, the modified algorithm works as follows. 
Repeat the following steps k (where k is a fixed number) times: 

1. Choose randomly x\, . . ., x n in the range |/|. 

2. Evaluate P at this set of values. 

3. If P ^ 0, then return with P ^ 0. 

Now the probability that P evaluates to zero, all of k times, even though 
P ^ 0, is at most C~ k , provided that |/| > C-deg(P). By choosing, C = 2 
and k — [log |] , we can ensure that the algorithm is correct with the 
probability at least 1 — e. If P has m terms, then the running time of the 
algorithm is bounded by 0(kn(m + deg(P))). In order to ensure that the 
probability of error is o(l), we may choose k = O(logn); then the running 
time of the algorithm is 0((m + deg(P))nlogn), and the probability of 
correctness becomes 1 — ( Q(1) ) . 

Another way to reduce the range of \I\ is to use modular arithmetic. In 
particular, we can perform all calculations modulo q where q is some prime 
number. 
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are several excellent survey papers exposing these ideas in depth, for instance, 
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and Kapur [110], Kapur and Narendran [115], Lankford [126], Pan [159], Schaller 
[182], Shtokhamer [190], Szekeres [199], Trinks [202] and Watt [207]. Additional 
related materials can also be found in a special issue on "Computational Aspects 
of Commutative Algebra" in the Journal of Symbolic Computation (Vol. 6, Nos. 2 
& 3, 1988). 

The question of degree bounds and computational complexity for the Grobner 
basis in various settings is still not completely resolved. However, quite a lot is 
now known for the case when the underlying ring of polynomials is K[xi, . . ., 
x n ], where K = a field. Let D(n,d), I(n,d) and S(n,d) denote the following: 

1. D(n,d) is the minimum integer such that, for any ordering and for any 
ideal I C K[xi, . . ., x„\ generated by a set of polynomials of degree no 
larger than d, there exists a Grobner basis whose elements have degree no 
larger than D(n, d). D'(n, d) is a similar degree bound for the special case 
where the ordering is assumed to be degree-compatible. 

2. Similarly, S(n,d) is the minimum integer such that, for any set of poly- 
nomials {g\, . . ., g m } C K[xi, . . ., x„\, all of degree no larger than d, the 
module of solutions of the following equation: 

hi gi H + h m g m = 

has a basis whose elements have degree no larger than S(n, d). 

3. Finally, I(n, d) is the minimum integer such that, for any set of polynomials 
{gi, . . ., g m } C K[xi, . . ., x„] and a polynomial / G (gi, . . ., g m ), all of 
degree no larger than d, the following equation 

hi gi H +h m g m = f 

has a solution of degree no larger than 7(n, d). 
Following summary is taken from Gallo[76]: 

1. Relationship among D(n,d), D'(n,d), I(n,d) and S(n,d): 

(a) S(n,d) < D(n,d) (Giusti[82]). 

(b) S(n,d) < I(n,d) < S(n,d) 0( - n) (Lazard[129]). 

(c) D'(n,d) < D(n,d) (Yap[214]). 

2. Upper bounds for 7(n, d) and S(n, d): 

(a) S(n,d) < d + 2(mdf n ~ 1 (Hermann[93] and Seidenberg[187]). 

(b) I(n,d) < 2(2d) 2 " _1 (Masser and Wustholz[140]). 

(c) S(n,d) < d 2(>o«3/io g 4)„ ( Lazard [ 129 j) 

(d) I(n,d)<d (Lazard[129J). 

3. Upper bounds for D(n, d): 

(a) D(n,d) — 0(d 2 ( ' ) for homogeneous ideals in generic position 
(Giusti[82]). 

(b) D(n, d) < h 2 , where h — the regularity bound is no larger than d 2 
(Moller and Mora[150] and Giusti[82]). 
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(c) D(n,d) < d 2 (Dube[66]). 
4. Lower bounds D(n,d), D'(n,d), I(n,d) and S(n,d): 

(a) D(n,d> D'(n,d) (Yap[214]). 

(b) D'(n,d) > ft(d 2 ") (Moller and Mora[150] and Huynh[103]). 

(c) I(n,d) > d 2 " and S(n,d) > d 2 " , where n' « ra/10 (Mayr and 
Meyer [143] and Bayer and Stillman[18]). 

(d) I(n,d) > d 2 "" and S(n,d) > d 2 "" , where n" « n/2 (Yap[214]). 
One interesting open question is to close the gap in the following bounds: 

d < S(n, d) <d 

However, much less is known for the case when the underlying ring of polynomials 
is over the integers Z[r^i?, x„]. Let D(n,d) and I(n,d) denote the degree 
bounds in this case as earlier. Gallo and Mishra[79] have recently shown that 

D(n,d) < F4 n+ s(l + max(n, c, d, m)) and I(n,d) < F4 n+ s(l + max(n, c, d, m)), 

where c and d are the coefficient and degree bounds, respectively, on the input 
polynomials and Fk is the fc th function in the Wainer hierarchy. Note that if n, 
the number of variables, is assumed to be fixed, then these are primitive recursive 
bounds. 

The set of applications discussed in this chapter are taken from the papers 
by Buchberger [33], Gianni et al. [81], Spear [193] and some unpublished course 
notes of Bayer and Stillman. 

Problems 3.4 and 3.10 are based on some results in Dube et al. [64]; Prob- 
lem 3.7 is taken from Gianni et al. [81]; Problem 3.8 is from Gallo and Mishra 
[79]; Problems 3.9, 3.11, 3.12 and 3.14 are based on the results of Buchberger [33] 
(also see Mishra and Yap [149]) and Problem 3.18 is due to Schwartz [183]. 

In this chapter, what we refer to as a Grobner basis is sometimes called a 
"weak Grobner basis" in order to differentiate it from a stronger form, which has 
the following two properties (and depends on the associated reduction r): 

G C R is a "strong Grobner basis" of (G), if 



1. V/G(G) 



2. V/Gi? 



NF r G (f) = 



|NFH/)| = 1 



and 



That is, every element of the ring reduces to a unique normal form, which is 
0, if additionally the element is in the ideal. The Grobner bases defined in this 
chapter only satisfy the first condition. Existence, construction and properties of 
strong Grobner bases have been studied extensively; see Kapur and Narendran 
[115]. 
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Solving Systems of 
Polynomial Equations 

4.1 Introduction 

The Grobner basis algorithm can be seen to be a generalization of the clas- 
sical Gaussian elimination algorithm from a set of linear multivariate poly- 
nomials to an arbitrary set of multivariate polynomials. The S'-polynomial 
and reduction processes take the place of the pivoting step of the Gaussian 
algorithm. Taking this analogy much further, one can devise a construc- 
tive procedure to compute the set of solutions of a system of arbitrary 
multivariate polynomial equations: 

fi(xi, . . . ,x n ) = 0, 
f 2 (x 1} ...,x n ) = 0, 

fr{x\ i • • • , Xn) — 0, 

i.e., compute the set of points where all the polynomials vanish: 
{<&, • ■ • ,U : mi, ■ ■ ■ ,£n) - 0, for all 1 < i < r}. 

In this chapter, we shall explore this process in greater details. 

Just as the Gaussian algorithm produces a triangular set of linear equa- 
tions, the Grobner basis algorithm under the purely lexicographic ordering 
also produces a triangular set of polynomials, where the concept of a trian- 
gular set is suitably generalized. Roughly speaking, the constructed basis 
can be partitioned into classes of polynomial systems, where the last class 
involves only the last variable, the class before the last involves only the 
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last two variables, etc., and each of these classes also satisfies certain ad- 
ditional algebraic properties in order to guarantee that the set of solutions 
of the last k classes contain the set of solutions of the last (k + 1) classes. 
At this point, it is not hard to sec how to obtain the set of solutions of 
the original system of polynomial equations by a simple back substitution 
process: first solve the last class of univariate polynomials; then substitute 
each of these solutions in the class of polynomials immediately preceding 
it, thus obtaining a set of univariate polynomials, which can now be easily 
solved, and so on. 

These intuitions have to be clearly formalized. We shall first define 
the concept of a triangular set in the general setting and study how such 
a triangular set can be computed using the Grobncr basis algorithm of 
the previous chapter. After a short digression into algebraic geometry, we 
shall describe the complete algorithms to decide if a system of equations is 
solvable and to solve the system of equations, in the special case when it has 
finitely many solutions. A key lemma from algebraic geometry, Hilbert's 
Nullstellensatz , plays an important role here and will be presented in detail. 



4.2 Triangular Set 

Let G C S[x\, . . ., x n ] be a finite set of polynomials. Let the set G be 
partitioned into (n + 1) classes Go, G\, . . ., G n as follows: 



Go = G n S[X1,X2, ■ ■ ■ ,x n ] \ S[X2, ■ ■ ■ ,x n ] 

= the set of polynomials in G involving the variable X\ , 
and possibly, the variables x 2 , . . ., x n . 

Gi = G H S[X2,X 3 , ■ ■ ■ ,X n ] \ S[X3, ■ ■ ■ ,x„] 

= the set of polynomials in G involving the variable X2 , 
and possibly, the variables X3, . . ., x n . 



G„_! = G n S[x n ] \ S 

= the set of polynomials in G involving the variable x n . 

G n = G n s 

= the set of constant polynomials in G. 



That is, Gi is the polynomials in G that contain x i+ i but do not contain 
any Xj for j < i. 
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Definition 4.2.1 (Triangular Form) The ordered set of polynomials con- 
sisting of polynomials in Go followed by polynomials in Gi and so on up 
to G n is said to be a triangular form of G. □ 

Definition 4.2.2 (Strongly Triangular Form) Let if be a field, and 
G C K[xi, . . ., x n ] a finite subset of K[x\, . . ., x n ]. We say a triangular 
form of G is in strongly triangular form if 

1. (G n ) = {GC\K) = (0), and 

2. For each i (0 < i < n), there is a gi <E Gi containing a monomial of 
the form 

a xf +1 , where a € K and d > 0. □ 

For a given set of polynomials G, if the generators of (G) can be put 
into a strongly triangular form, then we shall show that there is a finite, 
nonempty set of common zeros for the system of polynomial equations 
given by G. While this statement needs to be made formal, first, note that 
every element in the ideal (G) must vanish at every common zero of G. 
Therefore we see that any other basis of (G) must have exactly the same 
set of zeros as G. Intuitively, condition 1 tells us that the set is nonempty 
as otherwise we have that a (nonzero) constant is equal to zero. Condition 
2 tells us that the set has a finite number of common zeros, as the following 
example shows: 

Example 4.2.3 Consider three systems of polynomial equations: G', G" 
and G'" C C[^^,-rv], in their triangular forms: 



G' 


= {x\x 2 


-x 2 , x\ 


-1}, 


G" 


= {x\x 2 


— x 2l x 2 ] 


-, and 


G'" 


= {x\x 2 


- x 2 }. 





Just by looking at the polynomials in G , G ' or G ", we cannot tell 
whether the system of equations will have finitely many zeroes or not. 

• In the first case, x 2 can take the values +1 and —1, since x\ = \. We 
see that, after substituting x 2 = +1 in the equation xix 2 — x 2 , one of 
the common zeroes of G' is (+1, +1), and after substituting x 2 = —1 
in the same equation xix 2 — x 2 , the other common zero is (+1, —1). 
Thus G' has finitely many common zeroes. In fact (G') has a basis 

{xi - 1, x\ - 1}, 

and this basis is in strongly triangular form. 
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• In the second case, x 2 can take the value (with multiplicity 2), 
since x\ = 0. We see that, when we substitute xi = in the equation 
x\xi — X2, the equation becomes identically zero, thus showing that 
for all £ € C, (£, 0) is a common zero. Thus G" has infinitely many 
zeroes. In fact, G" cannot be written in a strongly triangular form. 

• In the third case, we see that for all £, £ e C, G'" vanishes at (£, 0) 
and (1,C)- Again G'" has infinitely many zeroes, and G'" cannot be 
written in a strongly triangular form. D 

Definition 4.2.4 (Elimination Ideal) Let / C S[x\, . . ., x n ] be an ideal 
in the polynomial ring S[xi, . . ., x n }. We define the i th elimination ideal 
of I, li, to be: 

/i=/n%i + i,...,4 (0 < i < n). 

That is, 

Io = I 

h = i n s[x 2} . . . ,x n ] 

= the contraction of / to the subring S[x2, ■ ■ ■ , x n ]. 

/ n _i = ms[x n ] 

= the contraction of / to the subring 5[x„]. 

in = ms 

= the contraction of I to the subring 5*. □ 

Now, given a set of polynomial equations { fi = 0, . . ., f r = 0}, we 
would like to generate a new basis G = {gi, . . ., g s } for the ideal / = (/i, 
. . ., f r ) such that when we consider the triangular form of G, it has the 
additional algebraic property that 

n 

I^J Gj is a basis for the i th elimination ideal I{. 

3=i 

Since D li, we shall see that this implies that the set of solutions of 
the system of polynomials given by Uj=« Gj contains the set of solutions 
of Uj=i-i Gj, as desired. As a matter of fact, we shall achieve a somewhat 
stronger property: our computed basis G will be a Grobner basis of / and 
the set Uj=i Gj will also be a Grobner basis of li with respect to the same 
lexicographic admissible ordering. 

In order to simplify our proofs, we shall use the following generalization 
of lexicographic ordering. 
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Definition 4.2.5 (A Generalized Lexicographic Ordering) Consider 
the ring R = S[X, Y) = S[xi, . . ., x n , yi, ■ ■ ■, Vm]- Let > , > be two ad- 

X Y 

missible orderings on PP(X) and PP(Y), respectively. 

Define the admissible ordering > on PP(X,Y) as follows: 

L 

{if p>p' or 
iip = l/ and 

where p, p' e PP(X) and g, g' e PP(Y). □ 

Theorem 4.2.1 7/G is a Grobner basis for I with respect to > in S[X, Y], 

L 

iften GnS[F] is a Grobner basis for I (1 S[Y] with respect to > in S[Y}. 

Y 

PROOF. 

(1) By the definition of > , we have, for every / e S[X, Y], 

L 

Hmono L (/) e S[Y] & f e S[Y}; 

that is, a polynomial whose head term is in PP(Y) cannot include any 
power product involving Xi £ X. Hence 

Hcad Y (G nS[Y}) = Head L (G n S[Y}) 
= Head L (G) n S [Y] = Head L (I) n S[Y] 

(Since G is a Grobner basis for I) 
= Head L (lr\S[Y}) = Head Y (/ n S[Y\). 

(2) Since GCI, clearly G n S[Y] C I n S[Y}. 

(3) Thus Gn S[Y] is also a Grobner basis for 7{5[y]} with respect to 
> in S[Y]. □ 

In other words, it is easy to find a basis for a contraction if > preserves 

L 

the underlying admissible orderings. A useful corollary to the theorem is: 
Corollary 4.2.2 If G is a Grobner basis of I with respect to > in Slxi, 

LEX 

. . ., x n ] ( assuming X\ > ■ ■ ■ > x n ), then for each i — 0, . . ., n, 

LEX LEX 

1 Gfl S[x i+ i, . . ., x n ] is a Grobner basis for I n S[x i+ i, . . ., x n ] with 
respect to > in S[x i+1 , . . ., x n ] (assuming Xj+i > ••• > x n ). 

LEX ' LEX LEX 

2. Equivalently, Uj=i @j * s a Grobner basis for the i th elimination ideal 
Ii with respect to > . □ 

LEX 

As a result of the preceding corollary, we may simply refer to a Grobner 
basis of an ideal with respect to the purely lexicographic admissible ordering 
as its triangular set. 
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4.3 Some Algebraic Geometry 

From now on, we will only consider polynomial rings L\x\, . . . ,x n ] where 
L is an algebraically closed field. 

Definition 4.3.1 (Algebraically Closed Field) A field L is called al- 
gebraically closed if every polynomial in L[x] splits into linear factors, i.e., 
every nonconstant polynomial in L\x] has a zero in L. □ 

Example 4.3.2 (1) C = Field of complex numbers is algebraically closed. 

(2) M = Field of reals is not an algebraically closed field, since for 
example x 2 + 1 e R[r>] has no real zero. □ 

In general, we could study the geometric properties by considering any 
arbitrary field K. Then, of course, we would have to answer the question: 
what do we mean by a solution? That is, where do the solutions live? For 
instance, we could run into the problem that if we consider the field of 
rational numbers, K — Q, then an equation of the kind x 2 + y 2 + z 2 = 1 
has no solution. The usual remedy is to take solutions whose components 
all lie in an algebraically closed extension of the field K . Sometimes, even 
more generality is necessary: one considers an extension field £1 which is not 
only algebraically closed, but has the additional property that the degree 
of transcendency of Q over K is infinite. The field is called a universal 
domain. However, we have simply opted to ignore these difficulties by 
working over a sufficiently general field. 

We shall use the following notations: 

A = L[xi, . . . , x n ] = Polynomial ring in n variables over L. 
A™ = Affine n-space with coordinates in L, 
(n-tuples of elements in L). 

Definition 4.3.3 (Zero Set, Zero Map) Let F C A be a subset of poly- 
nomials in A. Then the set 



2(F) 



jp e A" : (v/ef) [/(P) = o]}, 



is the zero set of F. 



The map 



Z 



{Subsets of A} -» {Subsets of A™} 
fn Z(F) 



is the zero map. □ 



Some authors also refer to a zero set as a variety. In consistence with 
the accepted terminology, we shall not make any distinction between zero 
sets and varieties. 
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Definition 4.3.4 (Ideal, Ideal Map) Let X C A" be a set of points in 
the affine n-space, A™. Then the ideal 



J(X) = {feA : (vPei) [/(p)=o]} ; 

is the ideal of X. 



( 



Note: 



and 



i.e., 



VPel 



f(P) = A g(P) = 



(We a) [(f-g)(P) = 6 



V Pe A 



f(P) = o 



(v/isi) (vPel) [(/*/) (P) = 



(V /, g e J(X)) (Vftei) [/ - 5 G J(X) and /i/ e J(X) 

Thus, I(X) is in fact an ideal. ^ 
The map 

X : {Subsets of A™} — > {Subsets of A] 
: X i ► J(X) 

is the ideal map. □ 



Definition 4.3.5 (Algebraic Set) A set X C A™ is said to be an alge- 
braic set if A is a zero set of some set of polynomials F C A. 

X = Z{F). □ 

The following proposition shows that the zero set of a system of poly- 
nomials does not change as we augment this set by additional polynomials 
generated by linear combinations of the original polynomials. In other 
words, the geometric problem remains unchanged if we replace the system 
of polynomials by the ideal they generate or if we replace it by another 
system of generators for their ideal. 
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Proposition 4.3.1 Let F C A be a basis for some ideal I C A, i.e., 
(P) = I. Then the zero set of F and the zero set of I are the same. 

Z(F)=Z{I). 

PROOF. 

(1) Since F C /, we immediately see that 



Z(F) D Z{I). 



That is, 



P e z(i) 



V/e/ 



f(P) = o 



=* (v/e f) [f(P) = 

(2) Conversely, let P = {/i, .., f r }. 

PeZ(F) => / 1 (p) = ... = / r (p) = o 

(V / = fcl/l + • • • + Kf T 

=> (v/e /) [/(P) = 



P e z(P). 



/(^) = 

P e Z(J). □ 



Thus, we could have defined an algebraic set to be the zero set of some 
ideal. Note that an empty set is an algebraic set as it is the zero set of 
the improper ideal A = (1) and A" is the zero set of the zero ideal (0). 
Note that it is possible for different ideals to define the same algebraic set. 
Also algebraic sets are closed under set theoretic operations such as union 
and intersection. 

Following properties can be demonstrated trivially; we leave the proof 
as an exercise for the readers: 

Proposition 4.3.2 We have the following: 

1. If I and J are ideals, then 

ICJ => Z(I)DZ(J). 



2. If I and J are ideals, then Z (I) U Z( J) and Z (I) n Z (J) are algebraic 
sets and 



Z(I)UZ(J) = Z(lnJ) = Z(I-J), 

z(i)nz(j) = z(i + j). □ 
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Proposition 4.3.3 Let A = L[x\, X2, ■ ■ ■, x n ] and B — L[x2, ■ ■ ■, x n ] be 
two polynomial rings. Let H be a projection map from A" to A" -1 , defined 
as follows: 

II : A™ — > A"" 1 

If I C A is an ideal in A, and J = I n B a contraction of I to B, then 

U(Z(I)) C Z(J). □ 

Finally, 

Proposition 4.3.4 Let V C A™ and W C A m be algebraic sets. Then the 
product V xW C A™+"\ 

^xi^ = {(a,...,^,Ci,.-.,Cm>eA"+ m : 
(&,...,&,> €V and (Ci,...,Cm) e 

is afeo on algebraic set. Furthermore, 

1(V xW) = IL[xi, ...,x n ,y 1 ,...,y m } + JL[x lt ...,x n ,y 1: .. .,y m ], 
where I is the ideal ofV in L[x\, . . ., x n ] and J is the ideal of W in L[y\, 

■ ■■,y m ]- □ 



4.3.1 Dimension of an Ideal 

Definition 4.3.6 (Dimension) Let / be an ideal in the ring K[x\, . . ., 
x n ], where K is an arbitrary field. Assume that the set of variables x^i) = 
Mi, . . ., x v (i) — ui forms the largest subset of {xi, . . ., x„} such that 

If\K[u u ...,ui] = (0), 

i.e., there is no nontrivial relation among the Mi's. 

Then u\, . . ., ui are said to be the independent variables with respect 
to / and the remaining r = (n — I) variables, x^+i) = vi, . . ., ij(„) = v r 
are the dependent variables with respect to /. 

Also I is said to be the dimension of the ideal I (dim/ = I), and 
r = (n — I), its codimension. □ 

Suppose, now, that the variables xi, . . ., x n are so ordered that the de- 
pendent variables v\, . . ., v r appear earlier than the independent variables 
Mi, . . ., ui in the sequence, and that we write the polynomial ring as K[v\, 
. . ., v r , Mi, . . ., ui] = K[V,U]. As in our earlier definition, consider the 
following generalized lexicographic ordering: 
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Let > , > be two admissible orderings on PP(V) and PP(U), respec- 
tively. The admissible ordering > on PP(F, U) is derived as follows: For 
all p, p' £ PP(V) and q, q' £ PP(U), 

pq > p q if p > p or if p — p and q> q . 

L V " U 

Now by Theorem 4.2.1, we see that if G is a Grobner basis for I with 
respect to > in K[V, U], then Gr\K[U] is a Grobner basis for I(1K[U} = (0) 

with respect to > in K[U]. In particular, if G is a reduced Grobner basis 

then G n K[U] is either an empty set or {0}. 

Thus by considering all possible partitions of the set of variables X into 
two disjoint subsets V and U, and by computing the Grobner basis with 
respect to a lexicographic ordering such that 

v > u 

. LEX . 

one can compute the set of independent variables and hence the dimension 
of the ideal /. 

While the above argument shows that the independent variables and 
hence the dimension of an ideal can be effectively computed, the procedure 
outlined here is not the most efficient; some of the recent results [61] have 
improved the computational complexity of the problem significantly, both 
in sequential and parallel computational domains. 

4.3.2 Solvability: Hilbert's Nullstellensatz 

Next, we shall develop a key theorem from algebraic geometry: Hilbert's 
Nullstellensatz . Using this theorem, we shall see how Grobner bases can be 
advantageously used to settle several important questions about solvability, 
number of zeros, and, finally, finding the zeros of a system of polynomials 
F. 

Theorem 4.3.5 (Hilbert's Nullstellensatz) If L is algebraically closed 
and I C A = L\x\, . . ., x n ] is an ideal, then 

I = L[xi, . . . ,x n ] if and only if Z{I) = 0. 

PROOF SKETCH. 

If J = A, then I = (1) and it is easily seen that Z(I) = 0. 

We prove the converse by contradiction. Let M be a maximal ideal 
containing I such that 1 ^ M: 



f V v e V) (yu£U) 



I C M 5 (1). 
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Since A is Noetherian, such an ideal exists. (Show that this is a consequence 
of the ascending chain condition). 

The residue class ring F = A/M has no improper ideal and has a unit; 
thus, it is a field. 

To every polynomial f(x\, . . ., x n ) G A assign an element of the residue 
class ring F = A/M, given by the natural ring homomorphism. Since 
M 7^ A, every element a of L will correspond to the distinct clement 
a = a + M. [Otherwise, if a ^ b (a, b G L) and a = b, then a — b G M; so 
1 = (a - b)(a - b)- 1 would be e M.] 

Let (x\, . . ., x^) € F n be the images of {x\, . . ., x n ) under the natural 
ring homomorphism from A into A/M. Since, the ring operations in A/M 
are naturally induced from the same operations in A, and a € L maps into 
itself, we see that, for every / £ I C M, f(xi, . . ., x^) — 0, i.e., / has a 
zero in F n . 

But F contains the field L (up to an isomorphism) and F arises from 
L through ring adjunction of the residue classes 3^7 of Xj. Since L is alge- 
braically closed, there is an L-homomorphism 



Hence (<j>(xl), . . ., <j>(x^)) G L n is in Z{I) and Z(I) ^ 0. □ 

An immediate corollary of Nullstcllcnsatz is the following: 

Corollary 4.3.6 Let F C L\x\, . . . , x n ], where L is an algebraically closed 
field. Then 



4>:F -> L. 



Thus 




Z(F)=0 iff le(F) 



PROOF. 



2(F) = =► Z((F)) = 

=> (F) = L[x\, . . . , 
16(f). 



(^) 

16(F) A Z(F) ^ 



(3 P G 2(F)) 1(P) = 1 = 
Contradiction, since, in L, 1 ^ 0. □ 
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Definition 4.3.7 (Solvable System of Polynomials) Let F C L[x\, 
. . ., x n ] be a system of polynomials. F is said to be solvable if the sys- 
tem of polynomial equations has a common zero, i.e., 

if Z(F)£Q. □ 



Corollary 4.3.7 Let F C L[xi, . . . , x n ] and G be a Grobner basis for (F). 
Then F is unsolvable if and only if there is a nonzero c € G fl L. 
PROOF. 

(=0 



= 



le(F) 

(3c^0,ce £ 



1^0 



ce G 



c e G n L 



(<=) 



ceGnL,c^o 



cr x c = 1 e (G) 
le(F) Z(F) 



□ 



Another version of Hilbcrt's Nullstellensatz that we will find useful is 
as follows: 

Theorem 4.3.8 Let f € L[xi, . . ., x n ], where L is an algebraically closed 
field. Let F = {/i, . . ., f r } C L[xi, x n ]. Then if f vanishes at all 
common zeros of F , there is some natural number q such that f q G (F). 

i.e.,f€y/W)- 

PROOF. 

(1) / = 0. Then as € (F), there is nothing to prove. 

(2) / 7^ 0. Consider the polynomials /1, . . ., f r , 1 — zf € L[x\, . . ., x n , 
z]. These do not have a common zero, since if P = (£1, ...,£„,£) € A" +1 
is a common zero of /1, . . ., / r , then 

(i-z/)(p) - 1 -£■/(&,...,£„) = 1^0. 

By the first form of this theorem, we know that 

ie(/i,...,/ r ,i-*/) 

=> 1 = .9l/l + ' ' ' + 9r.fr + .9(1 - Zf) 

where 51, . . ., g r , g £ L[xi, . . ., x n , z] 

=> Substitute 1/f for z and the last term disappears: 



Section 4.3 



Some Algebraic Geometry 



145 



i _ 9l t , , 9r f 

l -j^ h + --- + jr> 

where g[, . . ., g' r E L[xi, . . ., x n ] 

f q = 9'{h + ---+9rfr 

where g'{, . . ., g" E L[x\, . . ., x n ] and q = max{gi, . . ., q r } 
=> /« e (F). □ 

That Theorem 4.3.8 implies Theorem 4.3.5 follows from the following 
observation: 

Z{I) = <=> 1 vanishes at every common zero of / 
lei ^ I = (1) = L[x u . . . ,x n ]. 

Application: Solvability 

Solvability(F) 

Input: F = {/i, . . ., f r } C L[x u ■ ■ ■, x„], 
L = An algebraically closed field. 
Output: True, if F has a solution in A n . 

Compute G, the Grobner basis of (F). Output False, if there is a 
nonzero c in G n L; otherwise return True. 

4.3.3 Finite Solvability 

Definition 4.3.8 (Finite Solvability) Let F C L[x\, . . ., x n ] be a sys- 
tem of polynomials. F is said to be finitely solvable if: 

1. F is solvable. 

2. The system of polynomial equations has finitely many zeroes. □ 

We will see that this exactly corresponds to the case when the ideal 
generated by F is a proper zero-dimensional ideal. Also, we will see that 
this corresponds exactly to the case when we can find a set of generators 
of (F), expressible in a strongly triangular form. 

Theorem 4.3.9 Let F C L[x\, . . ., x n ] be a system of polynomial equa- 
tions. Then the following three statements are equivalent: 

1. F is finitely solvable; 

2. (F) is a proper zero- dimensional ideal; and 

3. If G is a Grobner basis of (F) with respect to > , then G can be 

LEX 

expressed in strongly triangular form. 
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PROOF. 
(1^2): 

As F is solvable, (F) ^ (1), i.e., (F) n L = (0). Assume that 

■ • ■ >£l,n) 



,1 ) • • • j £m,n) 

are the finite set of common zeros of F. Define 

f{Xi) (X{ ' ' ' £ro,i)* 

We see that f(xt) is a degree m univariate polynomial in Xi that vanishes 
at all common zeroes of F. Thus 



(3 q > 0) [f( Xl y e (F) 



Thus, 



(F) n Lfc] ^ (0), 

and (F) is zero- dimensional. Also, since (0) £ (F) £1 (1), (F) is also 



proper. 



(2^3): 

Since (F) is a proper zero-dimensional ideal, we have (F) fl L = (0), and 



(vn) [(f)niW ^ (0) 



i.e., 



(Vk) (3/WeiN) [/(a;*) G (F) 
Since G is a Grobner basis of (F), we see that 

Hmono(/(i,)) = xP* G Hcad(G). 
Together with the fact that (G fl F) = 0, we get 



4 < A- 



(3 5l e G) [Hterm( 5 ) = 
Since we have chosen > as our admissible ordering, 

LEX 

Qi G L \x{ , . . . , x n ]\L[xi + l,... 1 Xn\ 

and it follows that for alH (0 < i < n) there exists a g i+ \ <G Gi such that 
<7j + i has a monomial of the form a ■ xf +1 (a e L and d > 0). Thus, G, 
a Grobner basis of (F) with respect to > , can be expressed in strongly 

LEX 

triangular form. 
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(3^1): 

Let I = (F) = (G), be the ideal generated by F. It follows that 

Z{I)=Z{F) = Z(G). 
Thus it suffices to show that I is finitely solvable. 

1. Since (G n L) = I n L = (0), 1 £ I, and I is solvable. 

2. We will prove by induction on i that for alH (0 < i < n), the z th 
elimination ideal, Ii has finitely many zeroes. We recall that by a 
previous theorem 

ii = m 

where Gi = (J ™ =i Gj , and Gi is in strongly triangular form. 

• Base Case: i = n— 1. 

G„_i consists of univariate polynomials in x n . Since G„_i is strongly 
triangular, there is some polynomial p(x n ) in G„_i of maximum de- 
gree rf n . Thus p(x n ) has finitely many zeros (at most d n of them). 
Since we are looking for common zeros of I n -i, and since p(x n ) € 
I n -i, wc see that has finitely many zeros (not more than d n ). 

• Induction Case: i < n — 1. 

By the inductive hypothesis, the (i + l) th elimination ideal Jj+i has 
finitely many zeroes, say D i+2 of them. 

Let II be the projection map defined as follows: 
n : A™" 4 -» A™- 4 " 1 

: (6+1,6+2, • • • ,61) !-» (6+2, • ■ • ,6i)- 

We partition the zero set of the i th elimination ideal Ii, Z(Ji) into 
equivalence classes under the following equivalence relation: P, Q € 

Z{h) 

p~q iff n(P) = n(Q). 

By Theorem 4.3.3, and the inductive hypothesis, the number of equiv- 
alence classes is finite, in fact, less than or equal to D i+2 - Let p(x i+ i, 
x i+2 , ■ ■ ., x n ) € G, be a polynomial containing a monomial of the 
form a • x^ 1 (a e L and > 0) — assume that di + \ takes the 
highest possible value. If 

[P]~ = {Q:II(Q) = <&+2, ■■■,&>} 
is an equivalence class of P, a common zero of Jj, then £ (where 
Q = (6 6+2, • • •, £n) € [P]~) is a zero of the univariate polynomial 
P(xi+i, 6+i, • • •, 60- Thus 

|[P]~| <di+i 

and has finitely many zeros (not more than d i+ i ■ D i+2 ). 
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(-i,+iy 


















f (+1.-D 



Figure 4.1: The zeros of X\x 2 + 1 = and x 2 — 1 = 0. 



The above argument also provides an upper bound on the number of 
zeros of the system of polynomials F, which is 

d\ ■ d 2 ■ ■ ■ d n 

where di is the highest degree of a term of the form xf* of a polynomial in 

Example 4.3.9 Suppose we want to solve the following system of polyno- 
mial equations: 

{xix 2 + 1, ij-^cKM 

The zeros of the system arc (—1, +1) and (+1,-1), as can be seen from 
Figure 4.1. 

Clearly the system is finitely solvable. 

Now if we compute a Grobner basis of the above system with respect 
to > (with x\ > x 2 ), then the resulting system is strongly triangular, as 

LEX LEX 

given below: 

{x 1 +x 2 , xj-1}. 

We solve for x 2 to get x 2 = {+1,-1}. After substituting these values for x 2 
in the first equation, we get the solutions (xi,x 2 ) = {(—1, +1), (+1, — 1)}. 

Application: Finite Solvability 

FiniteSolvability(F) 

Input: F = {/i , . . ., f r } C L[xi, . . ., x n ], 
L = An algebraically closed field. 
Output: True, if F has finitely many solutions in A". 
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Compute G, the Grobner basis of (F) with respect to > . Output 

LEX 

True, if G is solvable, and is in strongly triangular form; False, otherwise. 



4.4 Finding the Zeros 

Now we are ready to gather all the ideas developed here and devise an 
algorithm to find the zeros of a system of polynomials. The algorithm 
works by successively computing the common zeros of the i th elimination 
ideal and then extending these to the common zeros of the (i — l) th ideal. 

The algorithm involves computing the zeros of a univariate polynomial; 
while this is a hard problem for large-degree polynomials, the algorithm as- 
sumes this can be computed by some oracle. An interesting open problem, 
thus, is to study how this algorithm interfaces with various finite-precision 
(numeric) and infinite-precision (symbolic) algorithms available for com- 
puting the zeros of a univariate polynomial. 

However, the main appeal of the following algorithm is that we can 
turn a multivariate problem into a sequence of univariate problems via the 
Grobner basis. 



150 



Solving Systems of Polynomial Equations 



Chapter 4 



FINDZEROS(-F) 

Input: F = {/i, . . . ,f r } C L[xi, . . -,x„], 
L = An algebraically closed field. 
Output: The zeros of F in A n if F is finitely solvable. 

Compute G a Grobner basis of (F) with respect to > ; 

LEX 

if G is not in strongly triangular form then 

return with failure 
end{if }; 

H-.= { 9 eM; 

Pn-i := the GCD of the polynomials in H; 
X„_i := {<£„> : p(£„) = 0}; 

for i := n — 1 down to 1 loop 

Xi-i := 0; 

for all . . ., £„) G Xj loop 

•ff == {fffci. • • •, Cn) : 3 G Gi_i}; 

Pi_i := the GCD of the polynomials in _ff; 

if pi-\ £ L then 

:= Xi-iU 

{<Ci, Ct+i, • • •, Cn) : Pi-i(fi) = 0}; 

end{if }; 
end{loop }; 
end{loop }; 

return (Xo); 
end{FlNDZEROS} □ 



Theorem 4.4.1 Let F = {f\, . . ., f r } C L\x\, . . ., x n \ and L, an alge- 
braically closed field. Assume that there is an effective procedure to compute 
the zeros of a univariate polynomial in L[x]. 

Then in finite number of steps, the algorithm FindZeros computes the 
zeros of F in A™ if F is finitely solvable. 
PROOF. 

The termination of the algorithm easily follows from the fact that there 
are effective procedures to compute a Grobner basis, to check if a system 
is in strongly triangular form, to compute the GCD of a set of univariate 
polynomials and to compute the zeros of a univariate polynomial. 
Assume that F is finitely solvable. We want to show that 



X = Z{F). 
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It is easily seen that every element of Xo is a zero of (G) and thus of 
the polynomials in F: Xq C Z(F). 

To see the converse, we argue by induction on i that for alH (0 < i < n), 

Z((F)i) C X t , 

where (F)i is the i th elimination ideal of F. 

• Base Case: i = n — 1. 

By a previous theorem, we know that (F)„_i = (G„-iUG„) = (G n ), since 
(G n ) — (0), by assumption. Since G n -i C L[x n ], a principal ideal domain, 
(G„-i) = (pn-i), where p„_i is a GCD of G n -i- Thus, if (£„) is a zero of 
G„_i, then (£„) is also a zero of p n -i, and (£„) e X n _i. 

• Induction Case: i < n — 1. 

By the inductive hypothesis, the zeros of (F) i+ i arc in X i+i . Let II be the 
projection map defined as follows: 

II : A™" 4 -» A™- 4 " 1 

: (6+1,6+2, ■ • • ,£n) >-> (6+2, • ■ • ,6i>- 

Let £ i+2 , . . ., 6i> be a zero of (F),. Then 

(6+2,...,U =n((6+i,6+2,-..,U) e c 

Since 

( f, )*= ^u^j . 

we see that (6+i, 6+2, • • •, 61) is a zer0 °f ^i, and (6) is a zero of the set 

H = {g(a; i+ i,6+2, • • - ,£n):0 € GJ. 

But since H C L[x i+1 ], a principal ideal domain, we see that (6+1) is also 
a zero of pi, the GCD of the polynomials in H. 
Thus 

(6+1,6+2, • ■ • , 61) € -^i) 

and we see that 

X Q C Z(F) = Z((F)o) C X , 
and X = Z(F), as we wanted to show. □ 
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Problems 

Problem 4.1 

Prove that if I and J are ideals then Z{I) U Z( J) and Z(I)(lZ(J) are 
algebraic sets and 

z(i)uz(j) = z{ir\J) = z(i-j), 
z(i)nz(j) = z(i + j). 

Problem 4.2 

(i) Let fi, . . ., f r , gi, . . ., g s G L[x\, . . ., x n ] be a finite set of multivari- 
ate polynomials over an algebraically closed field L. Devise a procedure to 
decide if there is a point p G L n such that 

hip) = 0,...,/ r (p) = and 
ffi (p) ^ 0,..., fla (p) ^ 0. 

(ii) Let /i, . . ., f r , gi, . . ., ^ s and h e i[xi, . . ., x n ] be a finite set 
of multivariate polynomials over an algebraically closed field L. Devise a 
procedure to decide if the following statement is true: 

(/i(p)=OA-A/ r (p)=OAji(p) ? 40A-A S ,(p)54o) 
=> ft(p) = . 

Problem 4.3 

Given / G L\x\, . . ., x„] and a finite set of polynomials _F = {/i, . . ., 
.fr} Q L[xi, . . ., x n ] (L = an algebraically closed field), devise an algorithm 
for the radical ideal membership problem, i.e., an algorithm that decides if 

If so, express f q (for an appropriate q G N) as follows: 

f q = hi fi H h h r f r where f t G L[xi, . . . , x n \. 

Problem 4.4 

Consider the following well-known NP-complete problems: 
(i) Satisfiability: Let u\, be a set of Boolean variables that 

can take one of two truth values: true and false. If u is a variable, then u 
and u are literals with the condition that if u holds true, then u holds false 
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and vice versa. A 3-CNF formula is a conjunction of clauses where each 
clause is a disjunction of exactly three literals. For instance: 

(u\ V U3 V Ug) A (U2 V u 3 V U7) A (mJ V«5 V u^) A (uj V u 5 V u§). 

A 3-CNF formula is said to be satisfiable if it is possible to assign to each 
variable a truth value so that the formula evaluates to true (i.e., at least 
one literal in each clause has the value true). 

Satisfiability(C) 

Input: A 3-CNF formula C over a set of variables wi, . . ., u„. 

Output: If C is satisfiable then return True; 
Otherwise, return False. 

(ii) Graph 3-Colorability: Given an undirected graph G = (V,E), 
it is said to be K-colorable, if there is a mapping / : V — > [1..-K] such that 
every pair of adjacent vertices are assigned distinct colors. 

3-Colorability(G) 

Input: An undirected graph G = (V, E). 

Output: If G is 3-colorable then return True; 
Otherwise, return False. 

(iii) Hamiltonian Path: For an undirected graph G = (V, E), a 
simple path in G is a sequence of distinct vertices, (v\, V2, ■ ■ •, Vk), such 
that G E (1 < i < k). A Hamiltonian path in G from s to t 
(s, t e V, s and t are distinct) is a simple path from s to t that includes all 
the vertices of G. 

Hamiltonian(G, s, t) 

Input: An undirected graph G and two distinct vertices s and t. 

Output: If G has a Hamiltonian path from s to i then return True; 
Otherwise, return False. 

Show that in each case, one can find a set of polynomials /1, . . ., f r € 
L[xi, . . ., x m ] [where m, r, max(deg(/i)) are polynomially bounded by 
the input size of the problem instances] such that the problem has an 
affirmative answer if and only if the corresponding system of polynomial 
equations is solvable. 
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Problem 4.5 

One can define inductively a mapping from the class of Boolean formulas 
over {ui, . . ., u n } to the ring Z^[~r^^, . . ., follows: 



Boolean formulas 

FV G 
FAG 



Z 2 [xi, ...,x n ] 

Xi 

$(F) • $(G) + $(.F) + $(G) 
$(F) • $(G) 
$(F) + 1. 



This mapping usually follows from the classical Stone isomorphism lemma 
of logic. 

(i.a) Consider a system of polynomials over Zp: 



G 



x{ + xi 
x\ + x 2 



x, n + X Tl 



1 







(4.1) 



Show that the set of solutions of equation 4.1 in any algebraically closed 
field containing Zj* is simply (Zj*)*. 

(i.b) Let / e Zjz£[^\f , . . ., x n ) be a polynomial that vanishes at all the 
points of (Z^j*. Show that / e (G). 

(ii.a) Consider a truth assignment 



T : {ui, ...,u n } 
and a vector Wf associated with it: 

Vt = (ai , . . . , a„) where Oj = 



{true, false} 

if T(uj) 
if T(ui) 



true 
false 



Show that a Boolean formula evaluates to true (respectively, false) under 
a truth assignment T if and only if $(F)(vt) — 1 (respectively, = 0). 
(ii.b) Show that F = satisfiablc if and only if g (G). 

(ii.e) Show that F = tautology if and only if 1 + $(F) e (G). 



Problem 4.6 

We use the same notations as those of the preceding problem. Let 

F(m, . . . ,u m ) 

be a statement, in which F is a quantifier- free logical formula over the 
variables u\, . . ., u n . 



9 = fv m, . . . ,u„_ m j 
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Let 

NF£(*(F)) 

= /o(^l 5 • • • ; X n — m) "i" ^ ^ fi(%l ) ■ • ■ 7 ^n— m) J*i (^71— m+1 5 • • • ; *^n) 

where e PP(a;„_„ l+ i, . . ., x„). (See Problem 3.9). 
Prove that g holds if and only if 

1 + fa e (j\,f2, ...,xl+xi,xl+x 2 ,---,xl+Xn)- 

Problem 4.7 

If / is a zero-dimensional prime ideal in K[xi, 
I can be generated by n polynomials: I = (g\, . 
. . ., x n ] . 

Problem 4.8 

Consider two planar curves P{x, y) and Q{x, y) in R[V\ y\. Let G be 
a reduced Grobner basis of the following ideal in 6, u, y]: 

(P(a,b),Q(a,b),u-(a-x)(b-y)) 

computed with respect to a pure lexicographic ordering with a > b > u. 

LEX LEX 

Let V(u, x, y) be the polynomial in G n R[~, r\]. 

Assume that V p (u) = V(u 7 x n ,yo) (for some p = (x ,yo) € K K ) is a 
nonzero square- free polynomial in R[~]. 

Show that the number of positive real roots of V p (u) is exactly equal 
to the number of real zeros of (P, Q) lying in the top-right and bottom-left 
orthants centered at p, i.e., 

{(a, b) : (a > x , b > y ) or (a < Xq, b < y )}. 

Problem 4.9 

Let R = K[x\, . . ., x n ] be a polynomial ring over the field K and / 
an ideal of R. Then the equivalence relation = mod / partitions the ring 
R into equivalence classes such that f,g£R belong to the same class if 
/ = g mod /. The equivalence classes of R are called its residue classes 
modulo /. We use the notation R/I to represent the set of residue classes 
of R with respect to /. 

Let / denote the set {g : f = g mod /}. Check that the map / / is 
the natural ring homomorphism of R onto R/I. We sometimes write f + 1 
for /. R/I is called the residue class ring modulo I. 

(i) Prove the following statements: 



. . ., x n ], then show that 
. ., g n ) where g t & K[x u 
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1. R/I is a vector space over K. 

2. Let G be a Grobner basis, and let 

B = {p : p e PP(x!, . . . , x n ) such that p is not a multiple 
of the Hterm of any of the polynomials in G}. 

Then B is a linearly independent (vector space) basis of R/ (G) over 
K. 

(ii) Devise an algorithm to decide if Rj (F) is a finite-dimensional vector 
space for a given set of polynomials F C R. 

(iii) Devise an algorithm to compute the basis of a residue class ring 
R/(F), where F C R and (F) is a zero-dimensional ideal. Also compute 
the "multiplication table" for the computed basis B; \ip~i,p]<E B, then 
the (p~[, pj) th entry of the multiplication table gives a linear representation 
of pi ■ pj in terms of the basis elements in B. 



Problem 4.10 

Let if be a computable field and < be a fixed but arbitrary computable 

A 

admissible ordering on PP(a;i, . . ., x n ). Assume that G C K[x\, . . ., x n ] is 
a finite Grobner basis of (G) with respect to < . Devise an algorithm to 

A 

find 

p e (G) n K[ Xl ] 

where p is a univariate polynomial of minimal degree in x\. Your algorithm 
must not recompute a new Grobner basis. 



Problem 4.11 

Let / C k[xi, X2, . . ., x n ] be a zero-dimensional ideal in a ring of 
multivariate polynomials over an arbitrary field. Let D\, D2, ■ ■ ., D n be a 
sequence of nonnegative numbers such that 

(vKKn) (3/iS/fl k[xi]) [fi ? and dcg(/,) < A 



Show that I has a Grobner basis G with respect to total lexicographic 
ordering such that 



(VgeG) 



deg( 5 ) < J2 A 
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Problem 4.12 

A plane curve C is said to be given by its {polynomial) parametric form 
(p(t), «(*)>, if 

C = {(a,f3) e : (3 r e k) a = i(t) and /? = ii(t) }. 

Similarly, a plane curve C is said to be given by its implicit form f(x, y), 
if 

C = {(a,(3) eK* : V(a,0) =K}. 

Give an algorithm that takes as its input a curve in its polynomial 
parametric form and produces its implicit form. 



Solutions to Selected Problems 

Problem 4.1 

First, we show that Z(7) U Z( J) = Z(IJ) = Z(I n J). 



(1) p e Z(U) a P & z(i) 
=> (v/e/) (Vj£j) //;(./-') = 

= 



(3 / e /) [f(P) + 



=> (V 9 e J 
4 Fe z(j) 



Hence. 



Z(J)U2(J) D Z(7J). 

(2) Since I J C I n J, we have 

z(7j) d z(inj). 

(3) Lastly, we see that 

{in J ci) a (7 n J c J) 

Z(7nJ)DZ(7) A Z(7HJ)DZ(J) 
Z(7n J) D Z(7) U Z(J). 



Hence, 

Z(7) U Z( J) D Z(7J) D Z(7 C\ J) D Z(I) U Z( J), 
thus implying that all of these sets are equal. 
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Next, we show that Z(I) n Z(J) = Z(I + J). 



P e Z(l)nZ(J) 



V / e / 



<s> Nhei + J] 



f(P) = o] A (Vjgj) [fl(P) = 
) [W = o 



Pez(/ + J). 



Problem 4.2 

(i) The problem can be reduced to an instance of solvability of a system 
of polynomial equations by means of the following artifice due to A. Rabi- 
nowitsch (Math. Ann., 102:518). Consider the following set of equations 
in L[xi, . . ., x n , z]\ 

fi = o 

; (4.2) 

fr = 

■■■9s ■ z) = 0. 

Then, we claim that the following system is solvable if and only if the 
system in equation 4.2 is solvable: 

fi = o 

fr = (4.3) 

9i ± 
9 s ¥= 0. 

Assume that p = . . ., £„) e L n is a solution for the system in equa- 
tion 4.3. Let ( = (j)i(p) ■ ■ ■ g s (p)j ■ Such a £ exists, since gi(p) ■ ■ ■ g s (p) ^ 
and we are working in a field L. It is now trivial to see that p' = . . ., 
£„, C) € is a solution for the system in equation 4.2. 

Conversely, if p' = . . ., £„, () € is a solution for the sys- 

tem in equation 4.3, then p — . . ., £„) 6 L™ is a solution for the 
system in equation 4.3. Clearly, fi(p) = 0, . . ., f r (p) — 0. Additionally, 
9i(p) ' ' ' 9s(p) 7^ 0, since, otherwise, (1 — g\ ■ ■ ■ g s ■ z)(p') would have been 1. 
Since, we are working over a field, it follows that gi(p) ^ 0, . . ., g s (p) 7^ 0. 
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Thus it suffices to check that 

1 £ (fi, ■ ■■ ,/r,(l - 9i ■ ■ -9s ■ z))- 



(ii) By using a sequence of logical identities, it is easy to show that the 
formula 

(/i(p) = A • • • A / r (p) = A 9l (p) ? A • • • A g s {p) ? o) 
=> h(p) = (4.4) 
is equivalent to the following: 
(-3pGL n ) 

/i(p) = A • • • A / r (p) = A 9l (p) ? A • • • A g s {p) ? A fc(p) ^ 

Thus formula 4.4 holds true if and only if the following system of equations 
in L[xi, . . ., x„, z] is unsolvable: 



/i 







fr = 

(1 - ffi ' ' -5s ' ft ' z ) = 



is unsolvable, i.e, if 



1 G (/l, • •• ,/r,(l - 5i- • -5s • ft" 



Problem 4.4 

(i) We shall construct a set of polynomials over C[^^t, Xif, ■ ■ -j x„t, 
x„f, 2] such that the system of equations has a solution if and only if the 
given 3-CNF formula is satisfiable. For each variable u\ we introduce two 
variables, x^ and XiF, and the following set of equations: 



x iT - x iT 

x iF ~ x iF 
XiT + X iF - 1 



(4.5) 
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These equations guarantee that XiT and XiF only take the values and 1 
and that if x%t takes the value 1, then XiF must take the value 0, and vice 
versa. 

Now for each literal we define a mapping T where 
T(ui) = x lT and T(u~) = x iF . 
Now, extend the mapping T to a clause C — (u V v! V u") as follows: 

T(u V v! V u") = T(u) + T(u') + T(u"). 
Finally, extend the mapping T to a 3-CNF formula C\ A ■ ■ ■ A C m : 

m 

T(C 1 A---AC m ) = l-zl[T(C i ). (4.6) 

For instance, the polynomial associated with the 3-CNF example in the 
problem will be 

1-z(xit+x 3F +x st ) (X2F+X 3 T+X 7F ) {x4 F +x 5F +x eF ) (x 3F +x 5T +x S f) ■ 

Clearly, if the 3-CNF formula is satisfiable, then the system of equations 
(4.5) and (4.6) have a solution. (If is true then let x iF — 1 and x iF = 0, 
etc., and since for each clause Cj evaluates to true, T(d) ^ and for some 
value of z e C equation 4.6 vanishes.) 

Conversely, if the constructed system of equations has a solution then 
for each such solution, x iT takes a value in {0, 1}; assign m the truth value 
true iff x^ = 1. Such a truth assignment obviously satisfies the 3-CNF. 

The solution uses 2n + 1 variables and 3n + 1 equations each of total 
degree < (m + 1). Note that the equation T{C\ A • • • A C m ) can be reduced 
in polynomial time (using complete-reduction) by the equations {x\ T — 
XiT, x"ip ~ x iF ■ I < i < n} to yield an equation of degree 1 in each 
variable. 

A much simpler solution can be constructed, by considering the follow- 
ing polynomials in C[^~^, • • ■, x n ]. For each Boolean variable Ui 7 associate 
a variable Xi. Define a mapping T over the literals as follows: 

T(ui) = (xi-1) and f(u~)=x l . 

Now, extend the mapping T to a clause C = (uVii'V u") as follows: 

f{u Vn'V u") = f(u) ■ f{u') ■ f{u"). 

For the given 3-CNF formula C\ A ■ ■ ■ A C m , construct the following system 
of equations: 

f(d) = 0, T(C m ) = 0. 
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It is easily seen that the system of equations is solvable if and only if the 
original 3-CNF formula is satisfiable. 

For the example 3-CNF formula, the system of equations is the follow- 
ing: 

{xi - 1) x 3 (x 8 - 1) = 

X2 (X3 - 1) X 7 = 
X4 X5 X 6 = 

^3 (x 5 - 1) Xs = 0. 
This solution uses n variables and m equations each of total degree 3. 

(ii) Again we shall work over the field C and make use of the three cube 
roots of unity: 1, ui and to 2 — these three constants will be used to represent 
the three colors. For each vertex Vi, associate a variable Xi and introduce 
the following equation into the system: 

- 1 - 0. 

This "enforces" that the vertex Vi takes a color in {1, u>, lu 2 }. Now the con- 
dition that each pair of adjacent vertices [vi,Vj] <G E are assigned distinct 
colors can be "enforced" by the following set of equations: 

x 2 + Xi xj + x 2 = 0, where [vi, Vj] e E. 

It is not hard to show that the graph is 3-colorable if and only if the 
constructed system of equations in C[-r~\^, . . ., x„] (n — \V\) has a solution 
inC K . 

The solution uses \V\ variables and \V\ + \E\ equations each of total 
degree at most 3. 

(iii) As before, our construction will be over the algebraically closed 
field, C. Without loss of generality assume that s = v\ and t = v n . It 
suffices to find a bijective map / : V — > [l..n] such that f{v\) = 1, f(v n ) — 
n and such that if / maps two vertices to successive values (i.e., k and 
k + 1), then the vertices must be adjacent. 

For each vertex Vi, associate a variable Xi and additionally, introduce 
two auxiliary variable z and z' . The bijectivity of / can be "enforced" by 
the following sets of equations: 

x\ — 1 = 0, x„ — n = 
(xi — 1) (xi — 2) • • • (xi — n) = for 1 < i < k 

1-z i| (xi-Xj) = 

\<i<j<n 

The last condition (i.e., (V [vi,Vj] £ E) [|/(«») — f(vj)\ ^ 1] ) can be "en- 
forced" by the following set of equations: 

1 + z' — (xi — xj) 2 z' = 0, for all 1 < i < j < n such that [vi, Vj] £ E. 



162 



Solving Systems of Polynomial Equations 



Chapter 4 



The solution uses |V| + 2 variables and 0(|y| 2 ) equations of each of 
total degree at most \V\ 2 . 

Problem 4.5 

(i.a) Note that G is already in strongly triangular form and one can 
apply the algorithm in this chapter to find the zeros (finitely many) of G. 
It is now trivial to see that 

Z(G) = Ui : £ + £i = 0} x •• • x {£„ : + = 0} = (Z 2 )". 



(i.b) Let h = NFg.(/) (see Problem 3.9). Clearly, h vanishes at the 
common zeroes of G if and only if / vanishes at the common zeroes of G. 
Also, since the polynomials of G are all univariate and of degree 2, h is of 
degree < 1 in each variable. 

Now we claim that if h vanishes at all (Z 2 )™ points (common zeroes of 
G), then h must be identically zero. If h is univariate, then this is obvious. 
In order to prove this inductively, assume that the claim has been proven 
for all such /c-variate polynomials and we want to prove for the case when 
h is (k + l)-variate. Without loss of generality assume that the variable x\ 
occurs in h. Thus, we can express h as follows: 

h = xi h'(x 2 , ...,x k ) + h"(x 2 , ■ ■ ■ ,x k ) 

If h ^ 0, then not both h' and h" are identically zero. Thus for some point 

(6, ...,&) e (z 2 ) k , 

h' or h" (by the inductive argument). Case 1: If h" ^ 0, then h 
does not vanish at (0, ■ ■ ■ > Case 2: If h" — and h' ^ 0, then h does 
not vanish at (1,^2, ■ • ■ ,6c)- hi either case, we get a contradiction. Thus 
we conclude that h = 0. 

Since G is a Grobner basis for (G), we conclude that 

/ vanishes at the common zeroes of G 
=> / — >h = 

=* /e(G). 

(ii.a) The proof follows by a simple structural induction on the Boolean 
formulas. We omit the details. 

(ii.b) Clearly / is satisfiable if and only if there is a truth assignment 
for which / evaluates to true. This is equivalent to saying 



(3pe(Z 2 )") [$(F)(p) 



1 
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i.e., &(F) does not vanish at all the common zeros of G. The rest follows 
by the results of the preceding parts, 
(ii.e) 

/ = tautology 

<&(F)(p) = 1, for all p e (Z 2 ) n 
1 + vanishes at all p e (Z 2 ) n 

<s> l + e (G). 



Problem 4.9 

(i) Proof of the statements: 

Lemma 4.4.2 1. R/I is a vector space over K. 
2. Let G be a Grobner basis, and let 

B = {p : p G PP(xi, . . . , x n ) such that p is not a multiple 

of the Hterm of any of the polynomials in G}. (4.7) 

Then B is a linearly independent (vector space) basis of R/{G) over 
K. 

PROOF. 

The first statement is trivial; simply check that 

a(f + g + I) = af + bg + I, 

(a + b)(f + I) = af + bf + I, 

(ab)(f + I) = a(bf + I), 

!(/ + /) = / + /, 

where a, b <G K and f,g& K\x\, . . ., x n ]. 

Let f = f + (G) be an element of R/ (G) and let 

NF£j(/) = ci-pi + ---+q-pi ) 

where a e K and pi e PP(xi, . . ., x n ). Hence, for all 1 < i < I, pi is not 
a multiple of the Hterm of any polynomial in G, and pi e B. Since we can 
write / as 

7 = ci • pi -\ h ci ■ pi, 

B spans the vector space R/(G). 

Furthermore, the elements of B are linearly independent. Assume that 
for some pi, . . ., pin € B, we have 

ci -pi-\ \- c m -piU = 0, where ci E K. 

In other words, 

/ = ci - pi H h c m -p m e (G). 

Hence NF£(/) = / = 0, i.e., Cl = • • • = c m = 0. □ 
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(ii) Finite-Dimensionality of a Residue Class Ring 

FiniteDimensionality(F) 
Input: FCR. 

Output: True, if R/(F) finite-dimensional. 

Compute G, the Grobner basis of (F). Output true if for alH (1 < i < 
n), a power product of the form xf (ji > 0) occurs among the Hterms of 
the polynomials in G; otherwise, false [i.e., (F) is zero-dimensional]. To 
see that this works, note that if for some i, none of the power products of 
Hterms in G has the required form, then we get an infinity number of basis 
elements as Xi, xf, xf, . . ., etc., are not reducible. The converse is simple. 

(iii) Basis of a Residue Class Ring 

BasisComputation(F) 
Input: FCR. 

Assume that R/(F) is a finite-dimensional vector space. 

Output: (1) A basis B of the vector space R/(F); 

(2) The "multiplication table" for R/(F): 

the (pi, pj) th entry of the table gives a linear representation 
of pi ■ pj in terms of the basis elements in B. 

Compute G, the Grobner basis of (F). Let the set B be as in equa- 
tion (4.7). This is easily computed. For each pi, pj e B, compute the 
normal form of pi ■ pj : 

NF£j(p, ■p j )=c l -p l -\ h c m ■ p m . 

Then the (pi, pj) th entry of the multiplication table is 

ci • Pi H h c m ■ p^. 

The correctness of the procedure follows immediately from Lemma 4.4.2. 



Problem 4.11 

We are told that there is a system of nontrivial univariate polynomials 

/i(xi) = ai^xf 1 + ai^xf 1 ^ 1 H h a Ml , 

f 2 (x 2 ) = a 2 fiX^ 2 + a 2 .\x d ^ 1 H h a 2 ,d 2 , 

fn(x n ) = cinfixf? + a n ^x d ^ x H h a rlydn 

in the ideal / C k[x\, . . . , x n ], where (Vi) [di < Dj], and D\, . . . , D n is a 
fixed sequence of nonnegative integers. 
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Let G be a minimal Grobner basis for I (in the sense of Problem 3.2) 
with respect to the total lexicographic ordering where x\ > X2 > ■ ■ ■ > x n 

LEX LEX LEX 

We may assume that distinct elements of G have distinct head terms, since 
if not, throw out one or the other without altering Head(G). Then for all 
g e G satisfying the following conditions 

Hterm(g) 7^ Hterm(/i), 
Htcrm(g) 7^ fitcrm(/ 2 ), 



Hterm(g) 7^ Htcrm(/„), 

we know that Hterm(/j) (1 < i < n) does not divide Hterm(<7), since e /. 
Thus, 



(v.geG) 



deg(Htcrm( 3 )) < max - 1), D u . . . , D n 

n 

< 



But, since the Grobner basis is computed with respect to a total lexico- 
graphic ordering, each monomial of each jeG has a total degree smaller 
than the total degree of the headterm, and 



(v.geG) 



deg( 5 )<]TA 



i=i 



Bibliographic Notes 

The main algorithm presented in this chapter is due to Bruno Buchberger and 
has also been discussed and analyzed in many survey papers. The discussion in 
this chapter is largely based on the survey paper by Mishra and Yap [149]. Also 
see [16]. [32, 33]. For some related developments and complexity questions, also 
consult [38,39,61,123-125,130] 

A key to settling many important complexity question in computational alge- 
braic geometry has been an effective version of Hilbert's Nullstellensatz, due to 
Brownawell [29] and Kollar [118]. We state this result without proof: 

Effective Hilbert's Nullstellensatz. 

Let I = (/1, . . ., f r ) be an ideal in K[xi, . . ., x n ], where K is an 
arbitrary field, and deg(/i) < d, 1 < i < r. Let h £ K[xi, . . ., x n ] 
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be a polynomial with deg(h) = S. Then h vanishes at the common 
zeros of /, i.e., 

h € v7 

if and only if 

s 

where q < 2(d + l) n and deg(6;/i) < 2(8 + l)(d + l) n , 1 < i < r. 

There are several textbooks devoted solely to algebraic geometry. The fol- 
lowing is a small sample: Artin [7], Hartshorne [89], Kunz [121], Mumford [156], 
van der Waerden [204] and Zariski and Samuel [216]. 

Problem 4.4 was influenced by Bayer's dissertation [16]. Problem 4.5 is based 
on the results of Chazarain et. al. [45] and Kapur & Narendran [114]. Problem 4.8 
is based on Milne's work [146]; also, consult Pedersen's dissertation for some 
related ideas [162]. Problem 4.9 is based on Buchberger's algorithm [33]. 



3 bi, . . . ,b s e K[xi,. ..,x n ] 
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Characteristic Sets 



5.1 Introduction 

The concept of a characteristic sets was discovered in the late forties by 
J.F. Ritt (see his now classic book Differential Algebra [174]) in an effort to 
extend some of the constructive algebraic methods to differential algebra. 
However, the concept languished in near oblivion until the seventies when 
the Chinese mathematician Wu Wen-Tsiin [209-211] realized its power in 
the case where Ritt's techniques are specialized to commutative algebra. 
In particular, he exhibited its effectiveness (largely through empirical evi- 
dence) as a powerful tool for mechanical geometric theorem proving. This 
proved to be a turning point; a renewed interest in the subject has con- 
tributed to a better understanding of the power of Ritt's techniques in 
effectively solving many algebraic and algebraico-geometric problems. 

For a system of algebraic equations, its characteristic set is a certain 
effectively constructible triangular set of equations that preserves many of 
the interesting geometric properties of the original system [47,76-78,173] 
However, while it shares certain similarities with the triangular forms of 
the previous chapter, it is smaller in size but fails to retain certain algebraic 
properties. In particular, a characteristic set of a system of polynomials 
{F} is not a basis of the ideal (F). But because of their power in geometric 
settings, characteristic sets do provide an alternative and relatively efficient 
method for solving problems in algebraic geometry. 

Recently, the constructivity of Ritt's characteristic set has been ex- 
plicitly demonstrated [78]. The original Wu-Ritt process, first devised by 
Ritt [174], subsequently modified by Wu [209-211] and widely implemented 
[47], computes only an extended characteristic set. Furthermore, the Wu- 
Ritt process, as it is, has a worst-case time complexity which can only be 
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expressed as a nonelementary 1 function of the input size and thus, in prin- 
ciple, is infcasible. This difficulty has been alleviated by some of the recent 
algorithms devised by Gallo and Mishra [76, 77]. 

In this chapter, we begin by discussing the pseudodivision process. 
Next, we shall introduce the concept of a characteristic set, followed by 
a survey of the original Wu-Ritt process, and its applications in geometric 
theorem proving. We shall also sketch some of the key ideas that have led 
to recent efficient algorithms for computing a characteristic set. 

5.2 Pseudodivision and 

Successive Pseudodivision 

Let S be a commutative ring. As a convention, we define deg(O) = — oo. 

Theorem 5.2.1 (Pseudodivision) Let f(x) and g(x) ^ be two poly- 
nomials in S[x] of respective degrees n and to; 

f(x) = b n x n + ■ ■ ■ + b , 
g(x) = a m x m H ha - 

Let 5 = max(m — n + 1,0). Then there exist polynomials q(x) and r{x) in 
S[x] such that 

b i9( x ) = q( x ).f( x ) + r(x) and deg(r) < deg(/). 

Moreover, if b n is not a zero divisor in S, then q(x) and r{x) are unique. 
PROOF. 

We can show the existence of the polynomials q(x) and r(x) by induction 
on to: 

• Base Case: m < n 

Take q(x) — and r{x) — g{x). 

• Induction Case: m>n 
The polynomial 

g(x) = b n ■ g(x) - a m x m - n ■ f(x) 

has degree at most (to — 1). By the inductive hypothesis, there exist poly- 
nomials q(x) and r(x) such that 

b™~ n (b n - g{x) - a m x m - n ■ f{x)) 

= q(x) ■ f(x) +r(x) and deg(f) < deg(/). 

1 For a discussion of the nonelementary computational problems, see pp. 419-423 of 
the algorithms text by Aho et al. [3]. Roughly, a problem is said to have a nonelementary 
complexity, if its complexity cannot be bounded by a function that involves only a fixed 
number of iterations of the exponential function. 
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Taking q{x) = a m b™ 11 x m n + q(x) and r(x) = r(x), 

b S n g(x) = q(x)f(x)+r(x) and deg(r) < deg(/). 
Proof of Uniqueness: 

Suppose that b n is not a zero divisor in S and we have 

b 5 n g(x) = q{x) ■ f(x) + r(x) and dcg(r) < deg(/) 
= q(x) ■ f(x)+r(x) and deg(f) < deg(/). 

If q(x) — q(x) ^ 0, then [q(x) — q(x)]f(x) ^ and has degree at least n, 
since b n is not a zero divisor. However, this is impossible since 

deg(r — r ) < deg(/) and {q{x) — q(x))f(x) = r(x) — r(x). 

Thus q(x) — q(x) = = r(x) — r(x). □ 

Definition 5.2.1 (Pseudodivision) For any two polynomials g(x) and 
f(x) ^ in S[x], we shall call polynomials q(x) and r(x) in S[x] the 
pseudoquotient and the pseudoremainder, respectively, of g(x) with respect 
to f(x) [denoted PQuotient(g, /) and PRemainder(t/, /)] if 

b s n g(x) = q(x) ■ f(x) + r(x) and dcg(r) < n, 

where m = deg(g), n = dcg(/), b n = Hcoef(/) and S = max(m — n + 1, 0). 
Also, if g = PRemainder(g, /), then g(x) is said to be reduced with respect 
to f(x). □ 

Remark 5.2.2 If S is an integral domain, the pseudoquotient and the 
pseudoremainder of any pair of polynomials in S[x] are unique. □ 

The above argument leads to a recursive algorithm for pseudodivision: 

PSEUDODlVISIONREC(p(a;), f(x)) 

Input: f(x) = b n x n + ■ ■ ■ + b + £ S[x], 
g(x) = a m x m H h a G S[x]. 

Output: q(x) = PQuotient((/, /), and r(x) = PRemainder((?, /). 

if m < n then 

(q(x),r(x)) := (Q,g(x)); 

else 

(q(x),r(x)) := 

PSEUDoDiviSiONREC(&n5(a;) - a m x m n f(x), f(x)); 
q(x) := a m b™- n x m - n + q(x) " 
end{if}; 
end{PsEUDoDivisiONREc} □ 
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To derive an iterative algorithm, we introduce an integer variable k 
and develop a loop that decrements k from m to (n — 1), maintaining the 
following assertion invariant: 

b S n g(x)= q (x)-f(x)+b k n - n+1 -r(x). 

Let us represent q(x) and r(x) by their respective coefficient vectors 

q ) and (r m ,...,r ). 

PsEUDoDivisiONlT(p(a;), f(x)) 

Input: f(x)=b n x n + --- + b =£0€S[x], 
g(x) = a m x m H h a G 5 [a;]. 

Output: q(x) = PQuotient((?, /), and r(x) = PRemaindcr(<7, /). 

for i := to m — n loop := 0; 
for i := to m loop r» := o»; 
for k := m down to ri loop 

gfc-n := r k ■ b„~ n ; 

for j := 1 to n loop 

for j := to k — n — 1 loop 

r i :— b n ■ Tj\ 

end{loop }; 

end{PSEUDODlVISIONlT} □ 

The notion of a pseudodivision can be suitably generalized so that given 
two nonzero polynomials f(x) e S[x] and g(x, y\, . . ., y m ) £ S[x, yi, . . ., 
y m ], we can determine two polynomials q(x, y\, . . ., y m ) [pseudoquotient] 
and r(x, y\, . . ., y m ) [pseudoremainder] such that 

b 5 n g(x,y) = q(x,y) ■ f(x)+r(x,y), and deg x (r) < deg(/), 

where deg^, denotes the maximal degree of a: in a polynomial containing 
the variable x and b n = Hcoef(/) and 5 = ma,x(deg x (g) — deg x (/) + 1, 
0). In order to give prominence to the fact that the pseudodivision was 
performed with respect to the variable x, we may sometime write 

q(x,y) = PQuotient^, /, x) and r(x,y) — PRemainder(g, /, x). 

Theorem 5.2.2 (Successive Pseudodivision) Consider the following tri- 
angular form 

• • • ,Ud,Xl) 

fi{ui, . . .,u d ,x 1 ,x 2 ) 



fr{ u l: • ■ • 7 Xl: • ■ • 7 ^V) 
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and polynomial g = gi(ui, . . ., ua, xi, . . ., x r ) all in the ring S[ui, . . ., 
Ud, x\, x r }- Let the following sequence of polynomials be obtained by 
successive pseudodivisions: 

r r = g 
r r _i = PRcmainder(r r , f r , x r ) 
r r - 2 = PRemainder(r r _i, f r -i, x r -i) 

r = PRemainder(n, f\,x\). 

The polynomial tq G S[ui, . . ., Ud, x\, . . ., x r ] is said to be the generalized 
pseudoremainder of g with respect to f\, . . ., f r and denoted 

r = PRemainder (5, {/1, . . . , / r }) . 

We also say g is reduced with respect to fx, . . ., f r if 

g = PRemainder (g, {f u . . . , / r }) . 

Furthermore, there are nonnegative integers Si, . . ., S r and polynomials qi, 
. . ., q r such that 

1. b 5 /...b 5 ^g = qi ■/! + ■■■ + q r -f r + r , 

where bi = Hcoef(/i) £ S[ui, . . . , Ud], 

b r = Hcoef(/ r ) £ S[ui, . . . ,Ud,xi, . . . ,x r -i\- 
deg x .(r ) < deg x .(/i), for i = 1, . . ., r. 

PROOF. 

The proof is by induction on r and by repeated applications of the pseu- 
dodivision theorem given in the beginning of the section. □ 

5.3 Characteristic Sets 

Let K[xi, . . ., x n ] denote the ring of polynomials in n variables, with 
coefficients in a field K . Consider a fixed ordering on the set of variables; 
without loss of generality, we may assume that the given ordering is the 
following: 

xi -< x 2 -< ■ ■ ■ -< x n . 
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Definition 5.3.1 (Class and Class Degree) Let / e K[x\, . . ., x n ] be 
a multivariate polynomial with coefficients in K. A variable Xj is said to 
be effectively present in f if some monomial in / with nonzero coefficient 
contains a (strictly) positive power of Xj. 

For 1 < j < n, degree of f with respect to Xj, deg x (/), is defined to be 
the maximum degree of the variable Xj in /. 

The class and the class degree (Cdcg) of a polynomial / G K[x\, . . ., 
x n ] with respect to a given ordering is defined as follows: 

1. If no variable Xj is effectively present in f, (i.e., / G K), then, by 
convention, 

Class(/) = and Cdeg(/) = 0. 

2. Otherwise, if Xj is effectively present in f, and no Xi >- Xj is effec- 
tively present in f (i.e., / £ K[x\, . . ., Xj] \ K\x\, . . ., Xj-\}), then 

Class(/) - j and Cdeg(/) = deg x .(/). 

Thus, with each polynomial / € K[x±, . . ., x n ], we can associate a pair of 
integers, its type: 

Type : K[x\,. ..,#„]—> N x N 

: /^(Class(/),Cdeg(/)). □ 

Definition 5.3.2 (Ordering on the Polynomials) Given two polyno- 
mials /i and /2 G K[x\, . . ., x n ], we say f\ is of lower rank than / 2 , 

h < K 

if cither 

1. Class(/i) < Class(/ 2 ), or 

2. Class(/i) = Class(/ 2 ) and Cdeg(/i) < Cdeg(/ 2 ). 

This is equivalent to saying that the polynomials are ordered according to 
the lexicographic order on their types: 

h ~< f2 iff Type(/i) < Type(/ 2 ). 

LEX 

Note that there are distinct polynomials /i and / 2 that are not comparable 
under the preceding order. In this case, Type(/i) = Type(/ 2 ), and fi and 
/ 2 are said to be of the same rank, f\ ~ / 2 . □ 

Thus, a polynomial / of class j and class degree d can be written as 

/ = I d {xi, . . .,Xj-l)xj 

+ 7 d _i(a;i,...,a; : ,-_i)a^~ 1 H h Io(xi, ■ ■ ■ , Zj-i), (5.1) 

where Ii(xi, . . ., Xj-\) e K[x\, . . ., Xj-i] (I = 0, 1, . . ., d). 
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Definition 5.3.3 (Initial Polynomial) Given a polynomial / of class j 
and class degree d, its initial polynomial, In(/), is defined to be the poly- 
nomial Id{xi, . . ., Xj-i) as in equation (5.1). □ 

As a special case of Theorem 5.2.1, we get the following: 

Corollary 5.3.1 (Pseudodivision Lemma) Consider two polynomials 
f and g G K[x\, ... x n ], with Class(/) = j. Then using the pseudodi- 
vision process, we can write 

H.f) a 9 = Q.f + r, (5.2) 
where deg x . (r) < deg x . (/) and a < deg x . (g) - deg x . (/) + 1. □ 

If a is assumed to be the smallest possible power satisfying equa- 
tion (5.2), then the pseudoquotient and the pseudoremainder are unique. 
Also, polynomial g is reduced with respect to / if g = PRcmainder(g, /). 

Definition 5.3.4 (Ascending Set) A sequence of polynomials T — {fx, 
fi, • • fr) Q K\x\, ■ ■ ., x n ] is said to be an ascending set (or chain), if one 
of the following two conditions holds: 

1. r = 1 and /i is not identically zero; 

2. r > 1, and < Class(/i) < Class(/ 2 ) < • • • < Class(/ r ) < n, 

and each fi is reduced with respect to the preceding polynomials, 
/i's (1 < j < i). 

Every ascending set is finite and has at most n elements. The dimension 
of an ascending set T — {fx, f2, • • •, fr), dim J 7 , is defined to be (n — r). 
□ 

Thus, with each ascending set T we can associate an {n + l)-vector, its 
type, 

Type: Family of ascending sets (NU {oo})™ +1 , 

where oo is assumed to be greater than any integer. For all < i < n, the 
i th component of the vector is 



Type OF) [i] 



Cdcg( 3 ), if (3. 9 e T) [Class(.g) = 
oo, otherwise. 



Definition 5.3.5 (Ordering on the Ascending Sets) Given two ascend- 
ing sets 

F = ih, ■ ■ ■ , fr) and G = { gi , . . . , g s ) , 

we say T is of lower rank than Q , T < Q ' , if one of the following two 
conditions is satisfied, 
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1. There exists an index i < min{r, s} such that 



(V 1 < j < i) [fj 



9j 



and 



fi < 9i 



2. r > s and (V 1 < j < .sj fj 



9j 



Note that there are distinct ascending sets T and Q that are not comparable 
under the preceding order. In this case r = s, and (VI < j < s) [fj <~ gj] , 
and F and Q are said to be of the same rank, T ~ Q. □ 



Hence. 



T < Q iff Type [T) < Type (g) . 



Thus the map, ij/pe, is a partially ordered homomorphism from the family 
of ascending sets to (MU{oo}) K+F , where (NU {oo}) K+if is ordered by the 
lexicographic order. Hence, the family of ascending sets endowed with the 
ordering "-<" is a well-ordered set. 

Definition 5.3.6 (Characteristic Set) Let / be an ideal in K[xi, . . ., 
x n ). Consider the family of all ascending sets, each of whose components 
is in /, 

is an ascending set and f t e /, 1 < i < r|. 

A minimal element in Sj (with respect to the ~< order on ascending sets) 
is said to be a characteristic set of the ideal I. □ 

We remark that if Q is a characteristic set of /, then 

n > \G\ > n-diml. 

Since Q is an ascending set, by definition, n > \Q\. The other inequality 
can be shown as follows: Consider some arbitrary ordering of variables 
and assume that \Q\ = k and the class variables are Vk, • ■ ■, v\. Let the 
remaining variables be called u\, . . ., ui. We claim that u's must all be 
independent. 

Ir\K[ Ul ,...,ui\ = (0). 

Then n — k = I < dim / and \Q\ = k > n — dim/. 

The proof of the claim is by contradiction: Suppose that the claim is 
false, i.e., 

(3 «0^o) [f einK[ Ul ,..., Ul ] . 

Assume that f(u\, . . . ,ui) is of class Uj. Also, / is reduced with respect 
to those polynomials of Q with lower ranks. Then one can add / to Q to 
get an ascending set of lower rank, which is impossible, by the definition 
of characteristic set. 
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Also, observe that, for a given ordering of the variables, the characteris- 
tic set of an ideal is not necessarily unique. However, any two characteristic 
sets of an ideal must be of the same rank. 

Corollary 5.3.2 (Successive Pseudodivision Lemma) Consider an as- 
cending set T = (fi, fi, ■ fr) Q K[x\, x n ], and a polynomial 
g e K\x\, x n }. Then using the successive pseudodivision (see Theo- 
rem 5.2.2), we can find a sequence of polynomials (called a pseudorcmain- 
der chain), go, g\, . . ., g r — g, such that for each 1 < i < r, the following 
equation holds, 



(3^) (la,) [ln(f i ) a *g i = q' i f i +g i _i 



where <?i_i is reduced with respect to f and on assumes the smallest possi- 
ble power, achievable. Thus, the pseudoremainder chain is uniquely deter- 
mined. Moreover, each gi-i is reduced with respect to fi, f+i, . . ., f r . 

r 

In(/ r )^In(/ r _ 1 ) a - 1 • • • In(A) Ql 5 = £ q t f + g . (5.3) 

i=l 

The polynomial go € K\x\, x n ] is said to be the (generalized) pseu- 
doremainder of g with respect to the ascending set T , 

go = PRemainder (g, T) . 

By the earlier observations, go is uniquely determined, and reduced with 
respect to f\, fi, . . ., f r . We say a polynomial g is reduced with respect 
to an ascending set T if g — PRemainder (g, T) . □ 

For an ascending set T ', we describe the set of all polynomials that are 
pseudodivisible by T ', by the following notation: 

M(F) = I g e K[xi, . . . , x n ] : P Remainder^, J 7 ) = j. 

Theorem 5.3.3 Let I be an ideal in K[x\, . . ., x n \. Then the ascending 
set Q = (gi, . . ., g r ) is a characteristic set of I if and only if 



(V / € /) PRemainder(/, Q) 







PROOF. 

Suppose Q is a characteristic set of /, but that there is a nonzero polynomial 
h e / reduced with respect to Q, i.e., PRemainder(/i, Q) = h 7^ 0. If 
Class(/i) < Class(gi), then (h) G Si is an ascending set lower than Q, a 
contradiction. If on the other hand, 

< Class(gi) < • • • < Class(#,) < Class (/i) 

[and Class(/i) < min(Class(<7j + i), n)], then Q' — (gi, . . ., gj, h) <G Si is an 
ascending set lower than Q: 
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1. If j = r, then Q' is a longer sequence than Q, and Q is a prefix of C?'. 

2. If j < r and Class(/i) < Class^j+i), then there is nothing to show. 

3. Finally, if j < r and Class(/i) = Class(<7j+i), then, since h is reduced, 
we have Cdeg(/i) < Cdeg(<7j+i) thus showing that T <Q. 

This also contradicts our initial assumption that Q is a characteristic set 
of/. 

Conversely, suppose £/ is an ascending set but not a characteristic set 
of I and that every h £ I reduces with respect to Q. By assumption, there 
is another ascending set Q' e Si that is lower than Q. Let g\ ^ be the 
leftmost entry of Q' not occurring in Q. By definition, then, g' ;i G / is a 
polynomial with P Remainder^, Q) = g\ ^ — which leads to the desired 
contradiction. □ 

One interesting implication of this theorem is that if g £ Q, a charac- 
teristic set of /, then g e I but ln(g) £ I. Simply observe that 

PRemainder(In( ff ),£0 = Md) + 0. 



5.4 Properties of Characteristic Sets 



The main properties of characteristic sets are summarized in the next the- 
orem. As in the previous chapter, for any set of polynomials F = {/i, . . ., 
fr} Q L[x\, . . ., x n ], (L = and algebraically closed field) 2 we write 2(F), 
to denote its zero set: 

Z(F) = {(&,...,&) £L": (V/Sf) [m,...,^n) = 0] }. 



By F°° , we shall denote the set of all finite products of 
In(A),...,In(/ r ), 
and, for any ideal I in L[xi, . . ., x n ], we use the notation /: F°° for 

[hfel] }. 

It is easily seen that /: F°° is itself an ideal. Note that, for an ascending 
set T = (/i, . . ., f r ), 

PRemainder(/i, T) = => he {T):T°°, 



2 AW the geometric arguments in this chapter do work out, even when we consider a 
ring of polynomials over an arbitrary field K and consider the zeros over any algebraic 
extension of K . However, for the sake of simplicity, we shall work over an algebraically 
closed field. 
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since P Remainder (ft,, T) — implies that we can write 

In(/ r )^ • • • In(/!) ai h = qi h + ■ ■ ■ + dr fr, 

for some nonnegative integers cti, . . ., a r and polynomials qi, . . ., q r e L[xi, 
. . . , x n ] . 

Theorem 5.4.1 Let I be an ideal in L[x\, . . ., x n ] generated by F = {fi, 
. . ., f s }. Let Q — (gi, . . ., g r ) be a characteristic set of I, and let J = [g\, 
■ ■ ■; 9r) C I be the ideal generated by the elements of Q. Then 

1. JC/C J:g°°. 

2- Z(G) \ (U=i -Z(ln(5i))) C Z(I) C Z{Q). 
3. I = prime ideal I n £°° = and I = J: g°° . 

PROOF. 

First, make the following two observations: 

1. J = (gi, . . . , g r ) C 7 [ because gi & L by definition]. 

2. In(gi) ^ 7 [ because P Remaindering ), = In(^j) ^ 0]. 
Assertion (1) follows from the first observation and the fact that 

V / e /) PRemainder(/, 0) = ; 

thus, for some element In(g r ) a *Tn(g r _i) Q!r - 1 • • • In(gi) Ql = g G 

r 

9 f = Y1 n sn e J. 

i=l 

Assertion (2) follows from the previous assertion and the elementary 
properties of the zero sets of polynomials. Thus, 

Z(I) C Z(J) = Z(G). 

Hence, it suffices to prove that 



Z(G) \ l(jz(ln(g t ))\ C Z{I). 



Let P G 7 n be a point in the zero set Z(Q) \ ^Ui=i Z (ln(gi)j^j . Let 

/ e I. Then there exists aj = J]i=i MsO"' € sucn tnat 9.f £ ^ 
Thus, g(P) /(P) = 0. But by assumption, g(P) ^ 0. Then /(P) = 0, as is 
to be shown. 

To see the last assertion, observe that if L is a prime ideal, then we have 
the following: 
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• If some g e belongs to 7, then so does one of the factors of g, 
say In(<7j). But this contradicts the second observation made in the 
beginning of this proof. 

• Consider an / e J:Q°°. By definition, there exists a g e Q°° such 
that 

5/ € J C 7 and g ^ 7. 
From the primality of 7, conclude that / € 7. That is, 7: = 7. D 

The inclusions in assertion (2) of Theorem 5.4.1 can be strict. Consider 
the ideal 7 = (x 2 +y 2 — 1, xy) and suppose x -< y. A possible characteristic 
set for 7 is {x 3 — x, xy} whose zeroes are a set of higher dimension than 
the zeroes of 7. Removing from it the zeroes of the initial of xy, i.e., the 
line of equation y = one gets only two of the four original points in Z{I). 

One way to interpret the preceding theorem is to say that constructing 
characteristic sets helps only in answering geometrical questions, but not 
with general algebraic problems. In particular, we see that characteristic 
sets are not powerful enough to handle the general membership problem for 
an arbitrary ideal. However, as an immediate consequence of the theorem, 
we do have the following: 

PRemainder(/, Q) = <^> f el, 

provided that 7 is prime. 

5.5 Wu-Ritt Process 

Now, let us consider the following triangulation process, due to J.F. Ritt 
and Wu Wen-Tsiin, which computes a so-called extended characteristic set 
of an ideal by repeated applications of the generalized pseudodi vision. His- 
torically, this represents the first effort to effectively construct a triangular 
set corresponding to a system of differential equations. Here, we focus just 
on the algebraic analog. 

Definition 5.5.1 (Ritt's Principle) Let F = {f 1} . . ., f s } C K[xi, . . ., 
x n ] be a finite nonempty set of polynomials, and I = (F) be the ideal 
generated by F. An ascending set Q satisfying either of the following two 
properties is called an extended characteristic set of F. 

1. Q consists of a polynomial in K fl I, or 

2. Q = (gi, . . ., g r ) with Class(gi) > and such that 

9i G I, for all i = 1, . . . , r, 
PRemainder(/j, Q) — 0, for all j = 1, . . . , s. □ 
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The following algorithm, the Wu-Ritt Process, computes an ex- 
tended characteristic set by repeatedly adding the pseudoremainders R 
(obtained by successive pseudodivisions by a partially constructed mini- 
mum ascending chain) to F and then choosing a minimal ascending set in 
the enlarged set R U F. 

Wu-Ritt Process(F) 

Input: F = {/i, f.}CK[x u . . ., x n \. 
Output: Q, an extended characteristic set of F. 

5:=0; R:=<&; 
loop 

F — FUR; F':=F; := 0; 
while F' =£ loop 

Choose a polynomial / G F' of minimal rank; 

F' := F'\{g : Class (ff) = Class(/) and 

g is not reduced with respect to /}; 

end{loop }; 

for all / €F\g loop 

if r := PRemainder(/, Q) ^ then 

R := RU {r}; 
end{if }; 
end{loop }; 
until R = 0; 
return Q; 

end{Wu-RiTT Process}. □ 

It is trivial to see that when the algorithm terminates, it, in fact, returns 
an ascending set Q that satisfies the conditions given in Definition 5.5.1. 
The termination follows from the following observations: 

Let F C K[xi, . . ., x n ] be a (possibly, infinite) set of polynomials. 
Consider the family of all ascending sets, each of whose components is in 
F, 

is an ascending set and e F, 1 < i < r|. 

A minimal element in Sp (with respect to the -< order on ascending sets) 
is denoted as MinASC(F). The following easy proposition can be shown; 
the proof is similar to that of Theorem 5.3.3. 

Proposition 5.5.1 Let F be as above. Let g be a polynomial reduced with 
respect to MinASC(F). Then 

MinASC(F U {g}) -< MinASC(F). 

PROOF. 

Let MinASC(i ;l ) = (/i, . . ., f r ). By assumption, g is reduced with respect 
to F, i.e., 

PRcmainder(g, F) = g ^ 0. 
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If Class(g) < Class(/i), then (g) is an ascending set of F U {g} lower 
than MinASC(F). 

If on the other hand, 

< Class(/i) < • • • < Class(/j) < Class(g) 

[and Class(g) < min(Class(/j + i), ri) ], then (g\, . . ., gj, h) is an ascending 
set of F U {g} lower than MinASC(F). For more details, see the proof of 
Theorem 5.3.3. □ 

Now, let Fi be the set of polynomials obtained at the beginning of the 
i th iteration of the loop (lines 2-15). Starting from the set Fi, the algorithm 
constructs the ascending chain Qi = MinASC(Fi) in the loop (lines 4-9). 
Now, if Ri [constructed by the loop (lines 10-14)] is nonempty, then each 
element of Ri is reduced with respect to Qi. Now, since 

Fi+i = Fi U Ri, 

we observe that 

MinASC(Fo) y MinASC(Fi) > >- MinASC(F l ) > 

Since the ">-" is a well-ordering on the ascending sets, the chain above 
must be finite and the algorithm must terminate. However, it can be 
shown that the number of steps the algorithm may take in the worst case 
can be nonelementary in the parameters n (the number of variables) and 
d (the maximum degree of the polynomials) (see [78]). 

In general, an extended characteristic set of an ideal is not a character- 
istic set of the ideal. However, an extended characteristic set does satisfy 
the following property, in a manner similar to a characteristic set. 

Theorem 5.5.2 Let F C L[x\, . . ., x n ] (L — an algebraically closed field) 
be a basis of an ideal I, with an extended characteristic set Q = (g\, . . ., 
g r ) . Then 

Z{Q) \ (U^In^))^) C Z{I) C Z{9). 

PROOF. 

Let 

M{Q) = {/ : PRemainder(/,£) = o}. 

denote, as before, the set of all polynomials that are pseudodivisible by Q. 
Thus, by the properties of an extended characteristic set, 



(S) c I c (m(G)), 
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since by definition, F C A4(Q). Using the elementary properties of the zero 
sets of polynomials, we get 

z(m(G)) c Z(I) c Z{Q). 

Hence, it suffices to prove that 

Z(Q) \ ({JZ^Mg^ C Z{M(G)). 

Let P e L n be a point in the zero set Z{Q) \ ([J^ =1 Z (ln(gi)j^J . Let 

/ eM(Q). Then there exists ag= nl=i In (ft)" 1 G £°° such that gf e J. 
Thus, ff(P) /(P) = 0. But by assumption, ,g(P) ^ 0. Then /(P) = 0, as is 
to be shown. □ 



5.6 Computation 

Let I C K[xi, . . ., x„] be an ideal generated by a set of s generators, /i, 
. . ., / s , in the ring of polynomials in n variables over the field K. Further, 
assume that each of the polynomial fi in the given set of generators has its 
"total" degree, Tdeg bounded by d: 



Vl<*< 



Tdcg(/ 4 ) < d 



where by Tdeg(/), we denote £V deg Xt (/). Note that deg(/i) < Tdeg(/ 4 ) < 
« de g(/»)- 

Let S = (gi, . . ., g r ) be a characteristic set of the ideal / with respect 
to an ordering of the variables that will satisfy certain conditions to be 
discusses later. 

Our approach will be as follows: first, using an effective version of 
Nullstcllcnsatz 3 , we shall derive a degree bound for a characteristic set of 
a zero-dimensional ideal, and then use a "lifting" procedure to obtain a 
bound for the more general cases. Equipped with these bounds, we can 
exhaustively search a bounded portion of the ring for a characteristic set; 
the search process can be made very efficient by using simple ideas from 
linear algebra. 

Let us begin with the case when our ideal I is zero-dimensional. Note 
that every zero-dimensional ideal / contains a univariate polynomial hj (xj ) , 
in each variable xj, since by definition: 

Ir\K[ Xj ]^(0), for all j = l,...,n. 



3 Originally due to Brownawell [29] and later sharpened by Kollar [118] — see the 
Bibliographic Notes of Chapter 4. 



182 



Characteristic Sets 



Chapter 5 



Since the sequence (hi, . . ., h n ) is clearly an ascending set in Si, we get 
a similar bound on the class degrees of the polynomials in a characteristic 
set of /. Bounds on the total degrees of Q follow. 

1. Clearly Class(<7j) = Class(/ij) = j. 

2. Cdeg( 5j ) < Cdeg(fy) < deg(^). 

3. For all 1 < i < j, deg x .(gj) < deg x .(gi), as gj is reduced with respect 
to the preceding g^s. Thus 

Tdeg( ft ) < Cdeg(ft)^J - j + 1 < n (maxdeg(^)) . 

By combining bounds obtained by a version of Bezout's inequality 4 with 
the effective Nullstellensatz, we can show that max^ deg(/ij) < 2(d + l) 2 ™, 
if Tdeg(/i)'s are all bounded by d. In fact the following stronger theorem 
holds: 

Theorem 5.6.1 (Zero-Dimensional Upper Bound Theorem) Let I - 

(fi, • • •; fs) be a zero- dimensional ideal in K[x\, . . ., x n ], where K is an 
arbitrary field, and Tdeg(/,) < d, 1 < i < s. Then I has a characteristic 
set Q = (gi, . . ., g n ) with respect to the ordering, 

Xl -< X 2 -<•••-< X n , 

where for all 1 < j <n, 

1. Class^) = j. 

2. Tdeg(ft) < 2n(d+ l) 2 ™. 



3. 

(3 a jA , a jtS e K[x\, x n ]j 



9j — a j,ifij 



and Tdeg(a jti fi) < 8n(d + l) 2n , 1 < i < s. □ 

The results on the bounds for a characteristic set of a zero-dimensional 
ideal can be extended to the more general classes of ideals, by a "lifting" 
process used by Gallo and Mishra [77], leading to the following general 
result: 



4 This effective version of Bezout's inequality is due to Hcintz [90] and states the 
following: 

Let I be a zero-dimensional ideal in L[xi, . . ., x„] generated by a set of 
polynomials of degree no larger than d. Then < 2(d + 1)™. 
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Theorem 5.6.2 (General Upper Bound Theorem) Let I = (/i, . . ., 
/ s ) be an ideal in K\x\, x n \, where K is an arbitrary field, and 

Tdeg(/i) < d, 1 < i < s. Assume that x\, xi are the independent 
variables with respect to I. That is, these independent variables form the 
largest subset of {x\, . . ., x n } such that 

InK[x 1 ,...,x l ] + (0). 

Let r = n — dim/ = n — l. Then I has a characteristic set Q = (g\, . . ., 
g r ) with respect to the ordering 

x\ ~< x 2 -<■■■-< x n , 

where for all 1 < j < r, 

1. C\&ss(gj) = j + I. 

2. Tdeg(^) < L>i = 4(s + l)(9r) 2r d(d + l) 4r2 . 
3. 

s 

9j = y ' a j,ifi, ) 

and Tdeg(a jii / i ) < D 2 = ll(s + l)(9r) 2r d(d + l) 4r2 , 1 < i < s. □ 

We are now ready to see how one can compute a characteristic set of 
an ideal, by using the degree bounds of the general upper bound theorem 
and fairly simple ideas from linear algebra. In particular, we shall assume 
that we have available to us effective algorithms for computing the rank 
and determinants of matrices over an arbitrary field. 

Let / be an ideal given by a set of generators {/i, . . ., f s } <G K[x\, . . ., 
x n ], where K is an arbitrary field, Tdeg(/i) < d. Assume that after some 
reordering of the variables, the variables x\, . . ., x n are so arranged that 
the first I of them are independent with respect to /, and the remaining 
(n — I) variables, dependent. 

xi -< x 2 -<■■■-< x n . 

See Chapter 4 for further discussion on how dependent and independent 
variables can be computed. 

Assume, inductively, that the first (j — 1) elements gi, . . ., gj-i, of a 
characteristic set, Q, of / have been computed, and we wish to compute the 
j th element gj of Q. By Theorem 5.6.2, we know that Class(<?i) = (I + 1), 
. . ., Class(gj_i) = (I + j — 1) and Class(gj) = (I + j). Let 



3 a 



e K[xi, 



Cdeg(gi) = d i+1 , Cdeg^-i) = 
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Thus, the polynomial gj sought must be a nonzero polynomial of least 
degree in xi+j, in I n K[xi, . . ., xi+j] such that 



Furthermore, we know, from the general upper bound theorem, that 



and Tdeg(gj), Tdeg(aj j i/,) < D = max(Di, £) 2 ), 1 < % < s. Thus gj 
satisfying all the properties can be determined by solving an appropriate 
system of linear equations. 

Let Mi, M 2 , . . ., M p be an enumeration of all the power products in 
xi, . . ., x n [called, PP(xi, . . ., x n )] of degree less than D; thus p satisfies 
the following bound, 



The enumeration on the power products is assumed to be so chosen that 
the indices A < \i only if one of the following three conditions is satisfied: 



1. M\ e PP(xi, . . ., x l+j ) and M M e PP(xi, . . ., x„)\PP(xi, . . ., x t+j ). 

2. M\, Mp e PP(xi, . . ., xi +j ) and 



deg xi+1 {gi) < d i+1 , deg 



•Xl+j-l 



< di+j-i- 




(5.4) 





and 




3. M\, Mp G PP(xi, xi +j ), 




and deg x;+ .(M A ) < deg x;+ . (M^). 



Let us express djj and gj symbolically as follows: 



p 




l<i<s 



A=i 



p 



9j 
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By equating the the terms of the same monomials on both sides of equa- 
tion (5.4) 



(3 dj-,1, . . .,a jtS e K[xi, . . .,x n ]j 
we get at most 



9 3 ^2" ■ 

i=l 



d + l+j 

l+J 



■ p<(d+l) n p 



equations, in (s + l)p unknowns, a's and (3's. We represent the homoge- 
neous system as follows: 



A\ - ■■ A sp :Bi ■ ■ ■ Bp 



= 0. 



ai 



Any solution of this system that minimizes the index of the last nonzero 
entry of the /3's will correspond to the desired gj. The existence of a 
solution of the desired nature follows from Theorem 5.6.2, and the rest 
follows simply from the choice of the ordering on the power products. (The 
reader may check!) 

Let Ai(= 1), A2, . . ., A; and m, be a sequence of indices such that, for 
1< i < 1, 



< rank A 1 ■ ■ ■ A Xi 
— rank A\ ■ ■ ■ A\ i+1 
< rank A\ ■ ■ ■ A\ i+1 



= • • • = rank 



and 



rank 



Ax---A s 



< rank 



A 1 ---A sp :B 1 



< • • • < rank 



Ai ■ ■ ■ A sp :Bi ■ ■ ■ B 



m—l 



rank 



A\ ■ ■ ■ A sp Bi ■ ■ ■ B n 



Then, we know that in the desired solution all the a's and /3's, not including 
a\ lt . . ., a\ t and f3i, . . ., /3 m , must be zero. Thus, we need to solve the 
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following system of linear equations 



A\i • • • A\ l : B\ ■ ■ ■ B„ 



— B„ 



"A, 
01 

0m-l 

This linear algebraic problem can be easily solved to derive the element, 
gj , a monic polynomial given by: 

gj = M m - $ m _ x M m _ x ftMi. 

Thus the procedure outlined here is completely effective. 



Theorem 5.6.3 Let I = (/i, . . ., f s ) be an ideal in K\x\, . . ., x n ], where 
K is an arbitrary field, and Tdeg(/j) < d, 1 < i < s. Then with respect to 
any ordering on the variables: 



X\ -i, X 2 -<•••-< Xn 



where the first dim(7) -many variables are independent, one can effectively 
compute a characteristic set of I. □ 



5.7 Geometric Theorem Proving 

Let us consider how the concepts of characteristic sets (or extended char- 
acteristic sets) can be used in a simple geometric theorem prover. The 
first such mechanical theorem prover, devised by Wu Wen-Tsiin [209-211] 
in China, has come to be known as the China prover and has been success- 
fully used to prove many classical and some new theorems in plane analytic 
geometry. Some further improvements in this direction has been achieved 
by Shang-Ching Chou [47]. 

Other approaches, based on different constructive methods in computer 
algebra, have also been proposed for this problem: For instance, Kutzler 
and Stiftcr [122], Kapur [111] and Kapur and Narendra [114] have proposed 
methods based on Grobner bases; Carra and Gallo [42] and Gallo [75] 
have devised a method using the dimension of the underlying algebraic 
variety; J.W. Hong [101] has introduced a seminumerical algorithm using 
an interesting gap theorem and "proof-by-example" techniques. 
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The method based on the Wu-Ritt characteristic sets is not, however, a 
real theorem prover, in the sense that it does not follow any logical proof- 
theoretic techniques; it simply takes an algebraic translation of a set of 
geometric statements and tries to verify its validity in the manner to be 
made more precise. For most of the geometric statements, however, the 
translation from geometry to algebra is far from being completely auto- 
mated. The fundamental principle that the translation method relics on is 
fairly simple, i.e., the classical coordinate method of Descartes introduced 
in the 17th century. 

Definition 5.7.1 (Elementary Geometry Statement) By an elemen- 
tary geometry statement, we mean a formula of the following kind: 

(/i = A h = A ••• A f s = 0) (.9 = 0), (5.5) 

where the /j's and g are polynomials in L\x\, . . ., x n ], the variables Xi's 
are assumed to be bound by universal quantification and their ranges are 
assumed to be the field L, the base field of the underlying geometry. We 
further assume that the base field L is algebraically closed. □ 

The conjunct = 0) is called the premise of the geometry state- 

ment, and will be assumed to be nontrivial, i.e., the set of points in L n 
satisfying the premise is nonempty. The statement g = is its conclusion. 
To prove a statement, then, is to show that a geometric formula is valid. 

However, one problem with the above algebraic statement is that it does 
not mention certain geometric degeneracies that are implicitly excluded: 
For instance, when a geometric statement mentions a triangle, it is con- 
ventionally assumed to mean those nondegencrate triangles whose vertices 
are noncollincar. But, on the other hand, spelling out all the conceiv- 
able geometric degeneracies makes the process unappealingly cumbersome. 
Wu's approach permits one to circumvent this problem as it produces these 
nondegencracy conditions as a natural by-product. In fact, Wu's approach 
proves an elementary geometry statement, in the sense that it shows that 
the conclusion is true whenever the premises are generically true. 5 Follow- 
ing Wu, in this case, we will say that the corresponding geometric statement 
is generically true. 

The Wu's algorithm is as follows: Assume that the input to the al- 
gorithm is an elementary geometry formula which is obtained after the 
theorem has been translated algebraically. 



5 In geometric terms, it shows that the conclusion polynomial vanishes on some open 
subset (under the Zariski topology) of the zero set of the system of polynomials, /1, /2, 
fs- 
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Wu's Algorithm(F) 

Input: Premises = F = {/i, . . ., f a } C L[xi, . . ., x„], 

Conclusion = g € L[xi, . . ., Xn \ • 
Output: Trivial, if the premises are contradictory; 

True, if the geometry statement is generically true. 

Compute Q, a characteristic set of (F); 
if Q = (1) then 

return Trivial; 
elsif PRemainder((/, Q) = then 

return True; 

else 

return Unconfirmed; 
end{Wu's Algorithm}. □ 

In order to understand Wu's algorithm we need to make the following 
observations: 

• If Q = (1), then, since every gi € Q also belongs to the ideal (F), 
we see that (F) = (1) and that Z(F) = 0. In this case, the system 
of premises are inconsistent and the geometry statement is trivially 
true. 

Also observe that if the ideal (F) = (1), then, by definition, its 
characteristic set must be (1). Thus, the algorithm always correctly 
detects a trivial geometry statement. 

• Suppose Q — (g\, . . ., g r ) ^ (1). If r = PRemaindcr(<y, Q) = 0, then 

r 

In( 5r ) a "In(5r-i) a '- 1 • • • In( 5 i) Ql ff - ]T q l9l € (/i, . . . , /.). (5.6) 

i=l 



Thus at every point F € L", at which /,'s (hence, g^s) vanish, but not 
In(<7j)'s, we note that the conclusion polynomial g must also vanish. 
The clause 

In( 9l ) = V • • • V In(g r ) = 

has been interpreted by Wu as the associated degeneracy condition 
for the original geometry statement. Thus, when Wu's algorithm 
returns the value "true," the following holds: 

(VPGL") (/i(P) = 0A---A/.(P) = 0)a 

^(ln( 5l )(P) = V • • • V In( 5r )(P) = 0) 

=> g(P) = o 
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Geometrically, we have 

r 

Z{g) D Z(f u ...,f s ) \ \JZ(ln( 9l )). 

i=l 

One problem with Wu's algorithm, as presented here, is that it is not 
complete, and in fact, it is only the first half of the algorithm developed 
by Wu. Nevertheless, just this portion of the algorithm has found many 
applications, as one can argue heuristically that the algorithm succeeds in 
vast majority of the cases [47]. 

Wu's algorithm guarantees that whenever the algorithm claims that a 
statement is generically true, it is indeed so; however, the converse does 
not hold. In fact, we may have a conclusion g that vanishes on some open 
set of the zero set of the premises, but Wu's algorithm may fail to detect 
this without further decomposition of the characteristic set. 

Even the complete Wu's algorithm has several hard-to-eliminate draw- 
backs. First, it is unable to work with arbitrary fields (not algebraically 
closed, e.g., M). If propositions about real geometry are investigated in 
this way, a false proposition will be rejected (because it is false also in the 
complex field), but sometimes a true theorem (over real closed field) may 
unfortunately be rejected. For example, consider the hypothesis x 2 +y 2 = 
and the conclusion y = 0. This is of course true in but false in C^. 

Lastly, the method is unable to handle inequalities. So geometric propo- 
sitions involving "internal" "external ," or "between" are not in the range 
of such a prover. In this sense, the so called "Wu geometry" is smaller 
in scope than the more general "Tarski geometry" which includes all the 
propositions that can be proved by real quantifier elimination algorithms 
[91]. We shall come back to these questions later in the book. 



Problems 

Problem 5.1 

Show that a characteristic set of an ideal is not necessarily its basis. 

Hint: Consider the ideal generated by the set of polynomials F = 
{x 2 +y 2 — l, xy} C Q[^, y]. Show that Q = {x 3 — x, xy} is a characteristic 
set of (F) under the ordering x ~< y. However, (Q) g (F). 

Problem 5.2 

True or False: 

1. I = (1) if and only if its characteristic set is (1). 

2. I = (1) if and only if its extended characteristic set is (1). 
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Problem 5.3 

Show that a characteristic set is an extended characteristic set but not 
the converse. 

Problem 5.4 

Using the characteristic set algorithm, devise solutions for the following 
two problems: 
(i) 

Solvability(F) 

Input: F = {fx, . . ., f r } C L[xi, . . ., x„], 
L — An algebraically closed field. 
Output: True, if F has a solution in L" . 

(ii) 

FiniteSolvability(F) 

Input: F — . . ., f r } C L[xi, . . ., x„], 
L = An algebraically closed field. 
Output: True, if F has finitely many solutions in L n . 

Problem 5.5 

Let H — {hj, h 2l ■ ■ ., h n } C K[x\, ■ ■ ., a; n ] (iif = a field) be a set of 
univariate monic polynomials, one in each variable: 

hj(xj) e K[xj] \ K and In(/ij) = 1. 

Then show that H = (h\, ■ ■ ■, h n ) is a characteristic set of (i?) with 
respect to the following ordering of the variables: 

x\ -< x 2 -<■■■-< x n . 

Problem 5.6 

Let F: K[x\, . . ., x„] K[x±, . . ., x n ] be a if-endomorphism, where F 
may be written as F = (/i, . . ., /„) (where /, £ K[xi, . . ., x n \). F is said 
to be invertiblc if each variable Xi can be expressed as a polynomial of /i , 

• • - j fn- 

Show that that F is invertible with inverse G — (g\, . . ., g n ) if and only 

if 

Q = {Xl ~ gi(Vl, ■ ■ - ,3/n), ■ •• , - 5n(j/l, • • - ,2/n)} 

is a characteristic set of the ideal 

I = (yi- fi(xi, ■ ■ -,x n ), . . . , y n - f n (xi, ■ ■ .,£„)) 
of if [j/i, . . ., y„, xi, . . ., x n ]. 
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Problem 5.7 

Let L be an algebraically closed field, and let I be an ideal in L[x\, . . ., 
x n ], generated by the following n polynomials, F — {/i, . . ., /„}, each of 
degree d: 

fi = xi - x d n , 
f 2 = x 2 - xf, 

fn—l Xn—1 ~ x n— 2> 

In X n X n _^. 

Then show that independent of the ordering on the variables, we have the 
following: 

1. Every characteristic set of F is of degree D > d n . 

2. Every extended characteristic set of F is of degree D > d n . 

Problem 5.8 

Show that the number of pseudodivision steps, T(n, d) that the Wu-Ritt 
process may use in the worst case is nonelementary in the parameters n 
(the number of variables) and d (the maximum degree of the polynomials), 
independent of the ordering on the variables: 

X\ < X2 < ■ ■ ■ < x n . 

In particular, show that 

T(0,d) = 1, 

T(n,d) < T(n- l,c'(l + V2) d ) + d, n > 1, 

leading to 

.d(l + V2) d 

T(n,d)<c-d{l + V2) d( - 1+V ^ y 

v ' 

n 

Problem 5.9 

Consider an ascending set T = (fi, . . ., f r ) C K[xi, . . ., x n ] and a 
polynomial g€ K[xi, ... x n ], where K is an arbitrary field, Tdeg(/,) < d, 
for all 1 < i < s, and Tdcg(g) < S. 

If go = PRemainder((7, T~) is the generalized pseudorcmaindcr of g with 
respect to T then show that 

Tdeg( 50 ) < (d+lY(S + l). 

Also, devise an algorithm that can compute the generalized pseudoremain- 
der g using 0(S°^ (d + l)°(™ r )) arithmetic operations. 
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Problem 5.10 

Consider the following algorithm for geometric theorem proving using 
the Grobner basis algorithm: 

Kapur's Algorithm(F) 

Input: Promises = F = {/i, . . ., / s } C L[xi, . . ., x n ], 

Conclusion = g £ L[xi, . . ., x„\. 
Output: Trivial, if the premises are contradictory; 

True, if the geometry statement is generically true; 
Return the nondegeneracy condition g' — 0. 



Compute Qi, a minimal Grobner basis of (F); 

Compute Q2, a minimal Grobner basis of (F U {gz — 1}); 

Comment: The last computation is done in the ring L[xi, . . ., x„, z\; 

if Si = {1} then 

return Trivial; 
elsif g 2 = {1} then 

return True, nondegeneracy condition g' = 1; 

Comment: No nondegeneracy condition is necessary; 
else for every g» G Q2 n L[xi, . . ., x n ] loop 

if g t (F) and 1 (F U {g t z - 1}) then 

return True, nondegeneracy condition g' = g»; 

end{if }; 



return Unconfirmed; 
end{KAPUR's Algorithm}. □ 



Prove that the above algorithm is correct. 

Note that for a given set of premises F = {/1, . . ., / s } and a conclusion 
g we shall say g' represents a nondegeneracy condition if and only if 

(vPslj (/ 1 (P)=0A---A/ s (P) = 0)a( 5 '(P)^0) => g(P) = 

but not the following: 

(vP6ij (/i(P)=0A-A/,(P) = 0) => g'{P) = 



Solutions to Selected Problems 

Problem 5.2 

(1) True. Since a characteristic set Q of an ideal / is a subset of /, wc 
have 

(1)C(0)CJC(1). 
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The converse is a trivial consequence of the minimality of the characteristic 
set among the ascending chains. 

(2) As earlier, an extended characteristic set Q' is a subset of the ideal 
I, and 1 G Q' implies that I = (1). 

However the converse is not necessarily true. Consider the ideal gen- 
erated by the polynomials: F — {x 2 + 2x + 1, xy 2 + y 2 + x} C Q[V\ y]. 
It is easily seen (applying the Wu-Ritt process, for instance) that F is an 
extended characteristic set of (F). However, (F) = (1), while its extended 
characteristic set docs not contain 1. Note that 

1 = (y i + 2y 2 + l)(x 2 + 2x + l) 

- (xy 2 + y 2 + x + 2)(xy 2 + y 2 + x) 
G (F). 



Problem 5.5 

First, note that H is an extended characteristic set of H (with respect 
to the ordering x\ -< Xi ~< ■ ■ ■ -< x n ); this is easily seen by an application 
of the Wu-Ritt process to the set H. Thus 

H C M(H). 

We claim that the set M (H) is an ideal. Thus 

(H)c(m(H)) =M(H), 

and H is a characteristic set of (H), since 

(V / G (ff)) PRemainder(/, H) = . 

In order to prove the claim, we make the following simple observations: 

1. Since ln(hj) = 1, for all 1 < j < n, 

PRemainder((/, H) — g' A P Remainder^", H) = g ! Q 

(A) PRcmainder( 3 ' + g" , H) = g' + g„ 

(B) PRcmainder(c ff ', H) = cg' Q , (v c e K^. 

2. Since hj e K[xj] \ K, for all 1 < j < n, 

PRcmainder(g', 7i) = => PRcmainder(x i g', 7i) = 0, ^Vi,j. 

Assume that hi — xf + Cd-ixf^ 1 + • • • + c , and let the following 
sequence 

9o(= 0),gi,---,9i-i>---,9n(=9') 
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denote the sequence of pseudoremainders, obtained by dividing g' by 
H. If deg 2 , j g' i _ 1 < (d — 1), then, clearly, 

PRcmainder(x i g', H) = P Remainder (xig 1 , . . . , fti)^ = 0. 

Otherwise, deg^,. = (d — 1), and 

•Ki9i—1 — ^d{x\ , • ■ • , Xi— 1 , X^-j-i ^ ■ • ■ 7 X n )x^ 

+ oy_i(xi, . . . ,Xi_i,x i+ i, . . . ,x„)xf _1 H 

-\- OL\ {x \ , . . . , X{—1 , X^-j-i , • • • , X n )Xj , 

where PRcmainder(afc, . . . , fti)) = 0. But 

PRcmainder(xj^_ 1 , ftj) 

= («d-i - Cd-iaj)^ -1 H h (ai - cia d )xi - c a d , 

and, by (1), PRemainder((a fc - c k a d ), (hi-i, hi)) = 0. 
Hence 

PRemainder(xj</, H) 

= PRemaindcr^PRcmainder(xi5-„ 1 , hi), • ■ • , fti>) =0. 

Thus if g' , g" G A4(H) and / G K[x\, . . ., x„], then, by observation 
(1A), g'+g" G M(H) and, by observations (1) and (2), fg' G M(H). 

A somewhat indirect proof of the statement may be obtained by first 
observing that H is a Grobner basis of the ideal (H) (with respect to any 
admissible ordering), which immediately implies that 

(V g G (H)) (V 1 < j < n) [deg Xj (g) > Cdeg(^) 

Since, Class(/ij) = j, and since it has the minimal possible class degree, 
indeed, H is a characteristic set of (H). 

Problem 5.7 

Let 7r G S n be a permutation of [l..n], and the arbitrary but fixed 
ordering on the variables be the following 

Xi = X 7r (l) -< X 7r(2 ) -<•••-< 

Note that In (£[xj] \ L) contains a nonzero polynomial Xj — xf" of minimal 
possible degree, d n . 
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1. Let Q = (gi, . . . ,g r ) be a characteristic set of F with respect to the 
chosen ordering. Since dim / = 0, 

\Q\=n and g 1 e I n (L[xi\ \ . 

Thus, D > Cdcg(. gi ) = min{dcg(/): / e In [L[ Xi ] \L)}=d n . 

2. Let Q' = (g[, . . . ,g' r ) be an extended characteristic set of F with 
respect to the chosen ordering. By an examination of the Wu-Ritt 
process, we see that In(g'j) = 1, for all 1 < j < r, and 

(Jz(ln(^-)) = and Z(Q') = Z(I) = a finite set. 

Hence 

\G'\=n and g x e I n (l[z;] \ l). 
Thus, D > Cdeg(. 9l ) = min{deg(/): / e I n (IN \ L)| > d". 
Problem 5.10 

Note that the conditions for </ to represent nondegeneracy are equiva- 
lent to the following: 

g 1 £ VW) and gg' e J{F). 
Or in geometric terms: 

Z{g) D Z{F) \ Z(g') ± 0. 

We shall need the following two facts: 

• Fact 1 : 

ge^{F) & le(Fu{ S 2-l}). 

See the proof of Theorem 4.3.8. 

• Fact 2: 

g' £ (Fu{gz-l})r\L[x u ...,x n } A g' (F) 

=> gg'e^{F). 

By assumption, we can express g' as follows: 

.<?' = h 1 f 1 -\ h h s f s + h(gz - 1), 

where h =/= 0. Now after substituting 1/g for z and noting that the last 
term disappears, we see that for some natural number q, we have 

g'g q = ti 1 h + --- + ti s f s e (F). 
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Thus {gg'Y e (F) and gg' e 

Thus it is easy to see that if the algorithm outputs "true" with a non- 
degeneracy condition g' , then g = in fact follows from fi = 0, . . ., f s = 0, 
under the assumption that g 1 ^ 0. 

Now assume that p is a nondegeneracy condition for the theorem. We 
wish to show that the algorithm will produce a nondegeneracy condition 
in this case. By assumption p £ \f(F) but pg e \J (F). Thus, 

p ™ g ™ g (F) - -l)=p ra e(fU {.gz - 1}), 

but by assumption, p m ^ (F). 

But since p m 0, [G = Gr6bncr(F U {.gz - 1})] there is a 5l e (F U 

{.gz — 1}) nL[xi, . . . , x„] but ^ (F); and the algorithm will produce such 
a gi, since not all such g^s can be y/(F). 
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The following computational complexity result [77] for characteristic sets is 
now known: 

Complexity of Characteristic Sets. 

Let I — (/i, . . ., f s ) be an ideal in K[x\, . 
arbitrary field, and Tdeg(/j) < d, 1 < i < s 
any ordering on the indeterminates, 

xi -< X2 <•••-< x„, 

where the first dim(/)-many variables are independent, one can com- 
pute a characteristic set of /, in O (s°( n \d+ l)°( n ^ sequential 
time or O (n 7 log 2 (s + d + 1)) parallel time. The polynomials in the 
computed characteristic set are of degree O (s(d + l)°( n M . 



. ., x n ], where K is an 
. Then with respect to 



Bibliographic Notes 



197 



Although the concept characteristic set has proven to be quite useful in the 
realm of geometry, several extensions are necessary to improve its power: one 
such idea is that of an irreducible characteristic set. 

Irreducible Characteristic Set. 

Let I — (/i, . . ., f s ) be an /-dimensional ideal in K[xi, . . ., x„]. Fur- 
ther, assume that the first I variables, xi, . . ., xi, are the independent 
variables, the last r = (n — I) variables, . . ., x n , the dependent 

variables, and the ordering on the variables is the following: 

xi -<■■■-< xi -< xi+i -<■■■-< x n . 

A characteristic set of J, Q — {gi, 
characteristic set of J, if 

gi — irreducible over K\ 

</2 = irreducible over K2 



gj = irreducible over Kj 



g r = irreducible over K r 

where QF denotes the field of fractions over an integral domain. 

This definition above is constructive, in the sense that the irreducibility of 
an ascending set can be tested algorithmically, since there are factorization algo- 
rithms over a field and over the successive algebraic extensions of a given field. 

The significance of the notion of an irreducible characteristic set becomes 
clear from the following proposition whose proof, using the concept of a generic 
point of an irreducible variety, can be found in [174] or [211]: 

Let / be an ideal, and Q a characteristic set of I. Then 
Q is irreducible I is prime. 

The idea of irreducible characteristic set extends the domain of applicability 
to a larger class of algebraic problems. 

1. Test for Deciding Primality of an Ideal: This can be achieved by us- 
ing the characteristic set algorithm together with the test for irreducibility 
of a univariate polynomial over an arbitrary field. 

2. Test for Deciding Membership in a Prime Ideal: Recall that if I = 
prime and its characteristic set is Q, then 



. . ., g r ), is said to be an irreducible 

= K(xi,. ..,xi) 
= Qf(k 1 [x 1+1 ]/{ 9i ) 



Qf^-iln+i-il/fe-i) 

QJ?^ r -l[l r -l]/(jr-l) 



g € I <^ PRemainder(#, Q) = 0. 
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3. For a given ideal I, by using the factorization algorithm, one can construct 
a sequence of irreducible characteristic sets Q 1 , . . ., Q k such that 



z(i) = z(g 1 )u---uz(g k ). 



Thus, 



9 e 



v7 |Vi 



PRemainder(p, C/i) = 



4. Lastly, using above geometric decomposition, one can construct a complete 
scheme for geometric theorem proving, where one decides if a formula of 
the kind 



V PEL' 



/ 1 (P) = 0A---A/ s (P) = 0jA 
n (ln( fll )(P) = V ■ ■ ■ V In(ff r )(P) = O^j 
* g{P) = 



is "generically true." 
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An Algebraic Interlude 



6.1 Introduction 

Before we move on to the topics of resultants and an algorithmic treatment 
of real algebra, we shall take a short pause to study in this chapter the 
unique factorization domain, the principal ideal domain, and the Euclidean 
domain. Of course, readers familiar with these topics may safely skip this 
chapter and go directly to the next chapter. 



6.2 Unique Factorization Domain 

Divisibility 

Definition 6.2.1 (Divisor) In a commutative ring S (with identity), we 
have the following: 

• A ring element s is said to be a divisor (or factor) of u (denoted s \ u) 



that is, u G (s). If s \ u, we also say u is a multiple of s (or u divisible 



• Let Si, . . ., s r € S be a set of ring elements. We say s is a common 
divisor of Si, . . ., s r if 



In what follows we shall let S denote a commutative ring 
with identity. 



if 




by s). 



S I S ^ 2 ' ' ' 7 '"'I ' 
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Thus 

Si e (s) A • • • A s r e (s) and (si, . . . , s r ) C (s). 

We say s is a maximal common divisor of si, . . ., s r if s is maximal 
among all common divisors of si, . . ., s r , under the partial ordering 
>-, defined as s >~ t if t \ s. Thus a maximal common divisor of s\, 
. . ., s r is the generator of a minimal principal ideal containing the 
ideal (si, . . ., s r ). 

We say s is a common multiple of Si, . . ., s r if 

S ^ I ^ 1 ■ ■ ' 7 | S . 

Thus 

s e (si) A ■ ■ ■ A s e (s r ) and (s) C (si) H • • • H (s r ). 

We say s is a minimal common multiple of si, . . ., s r if s is minimal 
among all common multiples of Si, . . ., s r , under the partial ordering 
>-, defined as s y t if t \ s. Thus a minimal common multiple of si, 
. . ., s r is the generator of a maximal principal ideal contained in the 
ideal (si, . . ., s r ). □ 

If 

u = s ■ t, u, s, t e 5, 

then we say w admits a factorization into factors s and t. 

If e is a unit (invertible element) of S, then every ring element u admits 
a factorization 

u = ue^ 1 ■ e. 

Such factorizations where one factor is a unit is called a trivial factorization] 
ue^ 1 is a trivial factor of w. 

Definition 6.2.2 (Indecomposable Element) A nonunit ring element 
u is said to be an indecomposable element of S if u admits only trivial 
factorization of the kind u = s ■ t where s or t is a unit. □ 

Definition 6.2.3 (Prime Element) A nonunit ring element u is said to 
be a prime element of S if for all s, t £ S 

u | s ■ t => u | s V u | t. □ 

Definition 6.2.4 (Associates) Two ring elements s,(eS are said to be 
associates in S (denoted, smt)ifs = t-e and e is a unit of S, i.e., if and 
only if t is a trivial factor of s. 

Thus, if s ss t, then s | t and i | s, and they generate the same principal 
ideal, i.e., (s) = (t). However, the converse is not necessarily true. 

Clearly, sa is an equivalence relation. □ 
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A ring element s is said to be a proper divisor (or proper factor) of u 
(denoted, s || u) if s is a factor of u but not a trivial factor of u, i.e., 

s \ u A s ,6 u. 

Note that (u) ^ (s) implies that s \\ u, but not necessarily the converse. 

Let S be an integral domain with identity, i.e., it has no nonzero zero 
divisor. Then the following propositions hold: 

Proposition 6.2.1 Every pair of domain elements s and t generating the 
same principal ideal are associates. That is, (s) = (t) implies that s m t. 

PROOF. 

If (s) = (t) = (0), then s = and t = 0, and there is nothing more to 
prove. Hence assume that s^O and t 7^ 0. Since (s) = (t), we have 

s = t ■ e and t = s ■ e' . 

Thus s — s ■ e' ■ e. Since S is an integral domain and s 7^ 0, e ■ e' = 1, and 
e is a unit in S. That is, s w t. □ 

Proposition 6.2.2 Let s and u be two domain elements, such that s | u 
and u\ s. Then (s) = (u), and s and u are associates. □ 

Proposition 6.2.3 A domain element s is a proper divisor of u if s \ u 
but u \ s. Thus s || u if and only if (u) £ (s). □ 

Proposition 6.2.4 Assume that S is Noetherian. Then, every nonunit of 
S is a finite product of indecomposable elements. 

PROOF. 

Let u be a nonunit of S. If u is indecomposable, there is nothing more to 
prove. Otherwise, u admits a nontrivial factorization u — s ■ t, where both 
s and t are nonunits. Continuing this way we see that u admits a factoriza- 
tion into indecomposable elements. The finiteness of this factorization is a 
consequence of the Noetherianness of S as follows: Since s || u and t \\ u, 

(u) £ (s) and (u) £ (t). 

If u admits a factorization into infinitely many indecomposable elements, 
then so does one of s and t, and we can construct a nonstationary ascending 
chain of ideals by repeating the above process, which would contradict the 
Noetherianity of S. □ 

Proposition 6.2.5 Every nonzero prime domain element u is indecom- 
posable. 
PROOF. 

Assume that u admits a factorization u = s ■ t. Thus s | u and t | u. 
Furthermore, since u is prime, and u | s ■ t, either u \ s or u \ t (or both). 
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Without loss of generality assume that u \ s. Hence u m s, and s = u ■ e. 



u = s ■ t = u ■ et 

=> et = 1 (since and S is an integral domain.) 

=> t is a unit of S 

=> u admits only trivial factorizations 
=> u is indecomposable. □ 

Unique Factorization Domain 

We consider an integral domain S. In the following discussions, the zero 
element of S will be excluded from the consideration. 

Definition 6.2.5 (Unique Factorization Domain: UFD) Lets' be an 
integral domain with identity. We say S is a unique factorization domain 
(or briefly, a UFD) if it satisfies the following two conditions: 

1. Every nonunit of S is a finite product of indecomposable elements. 

2. The factorization, obtained as above, is unique, ignoring order and 
unit factors. □ 

More explicitly, the second condition in the above definition means the 
following: If a = pi ■ ■ -p m = qi ■ ■ ■ q m , where pi and qj are indecomposable, 
then m — n, and on renumbering the indices of g/s, we have pi w qi, i = 1, 
. . ., TO. 

Theorem 6.2.6 An integral domain S with identity is a unique factoriza- 
tion domain if and only if we have the following: 

1. Every nonunit of S is a finite product of indecomposable elements. 

2. Every indecomposable element of S is a (nonzero) prime element of 



PROOF. 

Since the first condition is the same as the first condition of the definition, 
we focus only on the second condition of the statement. 

(=>) Let u be an indecomposable element of S. Assume that u \ s ■ t, i.e., 
u ■ v = s ■ t. Let 



Thus 



S. 



s = 



Yip'*, * = n^' and v 



* 3 



k 



Section 6.2 



Unique Factorization Domain 



203 



be factorizations of s, t, and v, respectively, into indecomposable elements. 
Thus 

! (N CM' ) (ii ) 

and by the uniqueness of the factorization, we see that u is an associate of 
either a p^ or a p" . Thus u \ s or u \ t. 

(<=) Let u be a nonunit element of S which admits a factorization into 
s indecomposable elements. We shall prove our assertion by induction on 
the number s. 

• Base Case: s = 1. 

In this case u is indecomposable, and has exactly one obvious factorization. 

• Induction Case: s > 1. 

Assume that u has two factorizations into indecomposable elements one of 
which involves exactly s factors: 

s t 
i=l j=l 

Since p\ is a nonzero prime element dividing the product p\ ■ ■ ■p' t , p\ must 
divide one of the elements p\ , . . . , p' t . Let, say, p\ divide p[ . Thus p[ = pi ■ t, 
and since p[ is indecomposable, t = s must be a unit of S, and pi s=a p\. 
Thus 

s t 

pi ■ Y[pi =pi • ^n^' 

i=2 3 =2 

and since p\ ^ and S is an integral domain, we have 

s t 
i=2 j=2 

the left-hand side involving exactly (s — 1) indecomposable factors. Thus 
by our induction hypothesis, the above two factorizations differ only in the 
order of the factors and by unit factors. Thus the assertion of the theorem 
follows. □ 

Greatest Common Divisor and 
Least Common Multiplier 

Let S be a unique factorization domain, and s\, . . ., s r € S be a set of 
elements in S. Let u be a maximal common divisor and v be a common 
divisor of s\, . . ., s r . If v is a unit of S, then v \ u. Hence assume that v 
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is a nonunit. Then u must be a nonunit. Let u and v admit factorization 
into decomposable elements, as follows: 

u = H Pi , v = l[p' j . 

i 3 

Then each p'- is an associate of some indecomposable factor of each of s\, 
. . ., s r , and thus an associate of some pi. Arguing this way we see that 
u | u. Thus if u and v are two maximal common divisors of s\, . . ., s r , 
then u | v and v \ u, and u m v. Hence s\, . . ., s r have a maximal common 
divisor u, unique up to a unit factor; such a maximal common divisor u is 
said to be a GCD of s\, . . ., s r : 

u = GCD(si, . . . , s r ). 

Similarly, we can show that si, . . ., s r have a minimal common multiplier 
w, unique up to a unit factor; such a minimal common multiplier w is said 
to be a LCM of s\, . . ., s r : 

w = LCM(si, . . . ,s r ). 

It is also easy to show that 

1. u = GCD(si, . . ., s r ) w GCD(GCD(si, . . ., s r _i), s r )). 

2. w = LCM(si, . . ., s r ) w LCM(LCM(si, . . ., s r _i), s r )). 

3. GCD(si, . . ., s r ) ■ LCM(si, . . ., s r ) w si • • • s r . 

We can also define GCD and LCM somewhat differently as follows: 

Definition 6.2.6 (Greatest Common Divisor) In a unique factoriza- 
tion domain, any set of elements si, . . ., s r has a greatest common divisor 
(GCD), that is, an element u which is defined as follows: 

1. u is a common divisor of s\, . . ., s r . 

2. If v is a common divisor of s\, . . ., s r , then v divides u. □ 

Definition 6.2.7 (Least Common Multiplier) In a unique factoriza- 
tion domain, any set of elements s\, . . ., s r has a least common multiplier 
(LCM), that is, an element w which is defined as follows: 

1. w is a common multiplier of s\, . . ., s r . 



2. If v is a common multiplier of si, . . ., s r , then v is divisible by w. 
□ 
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Definition 6.2.8 (Coprimality) Let S be a unique factorization domain 
with identity. If GCD(s,t) is a unit of S, then s and t are said to be 
relatively prime (or co-prime). □ 

The following are important but straightforward properties of relatively 
prime elements: 

• If s and t are coprime and t \ s ■ u, then t | u. 

• If s and t are coprime and if s | u and t \ u, then s • t | u. 

Primitive Polynomials 

Let S be a unique factorization domain, and S[x] be the ring of uni- 
variate polynomials over S. Let f(x) £ S[x] be a univariate polynomial of 
degree r > with coefficients in 5: 

/ (x) = s r x r + s r _! x r_1 H + s , s , . . . s r e S. 

The coefficient of the highest-degree monomial of f(x), s r , is said to be the 
head coefficient of f(x), Hcocf( J). 

Lemma 6.2.7 Every nonunit of S[x] is a finite product of indecomposable 
elements. 

PROOF. 

Since S is an integral domain, so is S[x]: 

If f(x) ^ and g(x) ^ but f(x)-g(x) = 0, then Hcoef(/) ^ 0, 
Hcocf(.g) ^ and Hcocf(/) • Hcoe%) = 0. 

Furthermore, by Hilbert's basis theorem, since S is Noetherian, so is 
S[x]. Thus S[x] is a Noetherian integral domain and the lemma follows 
immediately. □ 

Definition 6.2.9 Let f{x) e S[x] be a univariate polynomial of degree 
r > with coefficients in S: 

f{x) = s r x r + s r _i x r_1 + • • • + s , so, . . . s r e S. 

If GCD(s , . . ., s r ) is a unit of S, then we say f(x) is a primitive 
polynomial, and if s r = Hcoef(/) is a unit of S, then we say f(x) is monic. 

We also call GCD(s , . . ., s r ), the content of the polynomial f(x) 
Content(/). Thus Content(/) is unique up to a unit factor. 
Any polynomial f(x) £ S[x] can thus be written as 

f{x) = s ■ g(x), 
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where s — Content(/), and g(x) is a primitive polynomial. Here g(x) is 
called the primitive part of f(x): 

f(x) = Content (/) • Primitive(/). 

Since the content of / is unique up to a unit factor, and since S is an 
integral domain, it follows that Primitive(/) is also unique up to a multiple 
factor. □ 

Lemma 6.2.8 In a unique factorization domain S, the product of two 
primitive polynomials is a primitive polynomial. Thus if f(x), g(x) G S[x], 
then 

Content(/ • g) w Content (/) • Content(g), 
Primitive(/ • g) w Primitive(/) • Primitive(g). 

PROOF. 

Assume that p(x) and q(x) are two primitive polynomials whose product 
p(x) ■ q(x) is not primitive: 

p{x) = a n x" + ■ ■ ■ + a r x r + ■ ■ ■ + a\ x + a , 
q{x) = b m x m H \-b s x s -\ \-biX + b . 

Since p(x) ■ q(x) is not primitive, there is a (nonzero) prime element p' e S 
that divides the coefficients of p(x) ■ q(x). Furthermore, since p(x) and q(x) 
are primitive, we can choose a r and b s such that 

a r — the least indexed coefficient of p(x) not divisible by p' , 
b s = the least indexed coefficient of q(x) not divisible by p' . 

But the (r + s) th coefficient of p(x) ■ q(x) is given by 

c r + s = a r ■ b s + a r +i • b s -i + • • • + a r _i • b s +i + a r -2 ■ fc s +2 + ■ ■ ■ ■ 

Since p' \ ai (0 < i < r), p' \ bj (0 < j < s) and p' \ c r+s , we conclude that 
p' | a r ■ b s . Since p' is a prime element, p' \ a r or p' \ b s . In either case, we 
derive a contradiction to our choice of a r and b s . 

Let f(x) and g{x) e S[x]. Let Primitive(/)Primitive(t/) be a primitive 
polynomial h(x). Then 

Content (/ • g) Primitive(/ • g) = Content (/) • Content(g) h(x). 

Thus 

Content (/ • g) w Content (/) • Content(g), 
Primitive(/ • g) w h(x) = Primitive(/) • Primitive^), 

as was to be shown. □ 
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6.3 Principal Ideal Domain 

Definition 6.3.1 (Principal Ideal Domain: PID) Let S be an inte- 
gral domain with identity. We say S is a principal ideal domain (or, briefly 
a PID) if every ideal of S is a principal ideal, i.e., generated by a single 
ring element. D 

Let S be a principal ideal domain with identity. Then the following 
statements hold. 

Proposition 6.3.1 Every principal ideal domain is Noetherian. 

PROOF. 

Since every ideal in a principal ideal domain is finitely generated (actually, 
generated by one ring element) , a principal ideal domain is Noetherian. □ 

Proposition 6.3.2 Every nonunit of S is a finite product of indecompos- 
able elements. 

PROOF. 

Follows from Theorem 6.2.4, since every principal ideal domain is a Noethe- 
rian integral domain. □ 

Proposition 6.3.3 Let u be an indecomposable element of S . Then (u) is 
a maximal ideal of S, in the sense that every ideal of S properly containing 
(u) is the entire ring. 
PROOF. 

Let (v) be an ideal of S such that (u) £ (v). Thus v | u and utfc v. Hence 
u = r ■ v, where r is not a unit. Since u admits only trivial factorizations 
v must be unit of S, and (v) = S. □ 

Proposition 6.3.4 Let u be a nonzero prime element of S. Then (u) is a 
maximal ideal of S. 

PROOF. 

Simply note that u is an indecomposable element of S. □ 

Proposition 6.3.5 Let u be an indecomposable element of S. Then u is 
a nonzero prime element of S. 

PROOF. 

Assume that u \ s ■ t. If u \ s, then there is nothing more to prove. Hence, 
assume that u \ s. Hence (u) $1 (it, s) = (1). Thus there exist elements a 
and b such that 

1 = a ■ u + b ■ s and t = at ■ u + b ■ st. 
Thus u | t, and u is a prime element of S. □ 

Corollary 6.3.6 Every principal ideal domain is a unique factorization 
domain. □ 
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Note that 



If S is a principal ideal domain, and if {u} is a basis for the 
principal ideal (u) = (si, . . ., s r ), then 



u = GCD(s 



and if {w} is a basis for the principal ideal (w) — (s\) n • ■ ■ fl 
(s r ), then 

w = LCM(si, . . . , s r ). 



6.4 Euclidean Domain 

Definition 6.4.1 (Euclidean Domain) Let S be a commutative ring 
(with identity). Let g be a map from S \ {0} into N, which associates 
to every nonzero ring element s a nonnegative integer g(s) as follows: 

1. (Ordering on S) 



That is, S does not contain a nonzero zero divisor, i.e., S is an integral 
domain. The map g defines an ordering on the elements of S that 
preserves multiplication. In particular, 



That is, for any two ring elements s, t <G S (s ^ 0), there is an 
expression 



V s,t e S, s ^ 0, t ^ s- 1 ^ A g(s- 1) > g(s) . 




(V s €S, s± O) [g(s) = g(l ■ s) > g(l) 



2. (Division Algorithm) 




(t = q ■ s + r) A (r = V g(r) < g(s)) . 



t = q ■ s + r 



(q = quotient, r = 



remainder) 



in which cither r = or g(r) < g(s). 



Such a ring S is said to be a Euclidean domain. □ 
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Remark 6.4.2 For our purpose, we will make further assumption that the 
map g and the division process are actually computable. □ 

Given two elements, s, s' <E S 1 , we say s < s' if g(s) < g(s'). Let / be 
an ideal in an Euclidean domain. Define fh(I) as follows: 



m(/) = 



0, if / - (0); 

min < (7 \ {0}), otherwise. 



That is, if I = (0), then fh(I) = 0; otherwise, rh(I) is a nonzero s£ / such 
that for all s' £ I either s' = or g(s') > g(s). 

Clearly, such a map m exists, since the range of g is a subset of the set 
of nonnegative integers. 

Also, if J ^ (0), then define g(I) to be g(m(I)). 

Proposition 6.4.1 If I C 5 is an ideal in a Euclidean domain S, then I 
is a principal ideal generated by fa(T). 

PROOF. 

There are two cases to consider: 

• Case 1: I = (0). 

Since / is a principal ideal generated by 0, there is nothing more to 
prove. 

• Case 2: I ^ (0). 

Let s = fh(I), and t e I be an arbitrary element of the ideal I. Since 
S has a division algorithm, and s ^ 0, we can express t as 

t = qs + r, q,r e S, 

such that r = or g(r) < g(s). 

But r = t — qs £ I, and thus by our choice of s, r = or g(r) > g(s). 
Thus we conclude that r = 0, and 



Ntel 



t = qs e (s) 

Hence J C (s) C J, and J is a principal ideal generated by fh(I). LI 
Corollary 6.4.2 Every Euclidean domain is 

• Noetherian; 

• a principal ideal domain; 

• a unique factorization domain. □ 
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Proposition 6.4.3 Let I\ and I 2 be two ideals in a Euclidean domain, 

such that h ^ (0), I 2 ^ (0), and h £ h- Then g(h) > g(I 2 ). 

PROOF. 

Let si = rh(Ii) and s 2 — rh(I 2 ). By the hypothesis of the proposition, 
,si 7^ 0, and s 2 ^ 0. Now, by definition, 



By condition 2 of Definition 6.4.1, there exist qi and r\ such that 



where n = or g(h) = g{si) > g(n). 

Furthermore, since I\ C I 2 , we see that qi si G L 2l and n = s 2 — qiS\ e 
I 2 . Thus n = or #(n) > g(s 2 ) = g(I 2 ). 

Thus combining the above observations, we see that n = or g{I\) > 
g{ r i) > g{h)- However, n = would imply that s 2 = qi si e 7i, and 
h = (s 2 ) C 7i, contrary to our hypothesis. □ 

Note the following: 

If S is a Euclidean domain, and if u = m((si, . . ., s r )), then 



Example 6.4.3 (Examples of Euclidean Domains) 

1. S = K, a field. Let g be the map 

ff : K\{0}^N 
: s^O. 

Both the conditions are trivially satisfied. Note that for any two s, 
t e S (s ^ 0) 




s 2 = qi si + n, 



u = GCD(si,... 



and if w = m((si) n - - - (~l (s r )), then 



77; = LCM(si, . . . 



t = (t ■ s _1 ) • s + 0. 



2. S = Z, the ring of integers. Let g be the map 



.9 



Z\{0}^N 
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3. S = K\x), the ring of univariate polynomials over the field K. Let g 
be the map 

g : K[x}\{0}^N 
: /^deg(/). □ 



Corollary 6.4.4 Every field is a Euclidean domain □ 

6.5 Gauss Lemma 

Since S is an integral domain, we can define its field of fractions (i.e., 
quotient field), £ as 



: seSandteS\{0}} 



with the addition, multiplication, and multiplicative inverse defined, re- 
spectively, as below: 

£i £2 si ■ t 2 + s 2 ■ h 
ti t 2 t\ ■ t 2 

£l £2 gl ' s 2 

ti t 2 ti ■ t 2 

Sl \ t x 



if si 9^0. 

h J si 

Since S[x] is a Euclidean domain, S[x] is a unique factorization domain. 

Let 

/( a; ) = M ) 6eS\{0}, and /(x) e S[x]. 

Then we associate /(x) with /(x). Conversely, if /(x) € <S[x], then we 
associate /(x) = /(x) e with /(x). 

Lemma 6.5.1 Every indecomposable element of S[x] is a nonzero prime 
element. 

PROOF. 

Assume to the contrary, i.e., for some indecomposable element p(x) £ S[x] 
there are two polynomials f(x) and g(x) <E S[x] such that 

P(x)\f(x), p(x)\g(x), but p(x) | f(x)g(x). 

There two cases to consider. 
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• Case 1: deg(p) = 0, and p is an indecomposable element of S. 

Note that p | h(x) if and only if p | Content(Zi). Thus p is a prime 
element of S, and 

p \ Content(/), p \ Content (g), but p | Content(/) Content(g), 
which leads to a contradiction. 

• Case 2: deg(p) > 0, and p{x) is a primitive indecomposable polyno- 
mial of S[x\. 

Thus p{x) is an indecomposable (thus, a prime) element of S[x\: 
Since if 

p(x) = h(x) ■ h'{x) = — — 

then bb' p(x) = h(x) ■ h'(x) and 

p(x) = Primitive(/i) • Primitivc(ft,'). 

Also note that p(x) \ h(x) if and only if p(x) \ h(x). If h'(x) ■ p(x) = 
h(x), then h'(x) ■ p(x) — h(x) and | h(x). Conversely, if p(x) \ 
h(x), then h'(x) ■ p(x) = h(x). Thus 

h'(x) , . h(x) 

and bh'(x) ■ p(x) = b'h(x). Thus 

Primitive(/i') • p(x) = Primitive(Zi) and p(x) \ h{x). 

Thus p(x) is a prime element of S[x], and 

p(x) \ f(x), p(x) \ g(x), but p(x) | f(x) ■ g(x), 

which leads to a contradiction. CJ" 

Theorem 6.5.2 (Gauss Lemma) If S is a unique factorization domain, 
then so is S[x}. □ 

6.6 Strongly Computable Euclidean 
Domains 

Recall that every Euclidean domain is Noetherian. We further assume the 
following: 
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1. The Euclidean domain S under consideration is computable, i.e., for 
all s and h£S there are effective algorithms to compute 

— s, s + u, and s ■ u. 

2. For the given Euclidean domain S with the map 

g:S\{0}^M, 

there are effective algorithms to compute g(s), for all nonzero seS 
and to compute the quotient q = q(s, u) and the remainder r = r(s, u) 
of s and u G S, s ^ 0: 

u = q ■ s + r, such that r = V g(r) < g(s). 

In order to show that a Euclidean domain S satisfying the above two 
computability conditions is in fact strongly computable, we need to demon- 
strate that S is detachable and syzygy-solvable. 

Detachability: Using Euclid's Algorithm 

Let S be a Euclidean domain with the computability properties dis- 
cussed earlier. We shall present an extended version of Euclid's algorithm, 
which, given a set of elements si, . . ., s r € S, computes s = GCD(si, . . ., 
s r ), and a set of elements m, . . ., u r such that 

s = u\ ■ s\ + ■ ■ ■ + u r ■ s r . 

Note that (s) = (si, . . ., s r ), and the detachability of S proceeds as follows: 

Let t e S, {si, . . ., s r } C S and s = GCD(si, . . ., s r ). If s \ t, 
then t £ (s) = (si, . . ., s r ), otherwise t € (s) = (si, . . ., s r ) 
and ift = v-s, then 

t = (v ■ Ui)si + ■■■ + («■ u r )s r 

where s and Mi, . . ., u r are obtained from the extended Euclid's 
algorithm. 

Next, we present a generalized extended Euclid's algorithm based on 
successive division: 
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Extended-Euclid(si, . . ., s r ) 
Input: si, . . ., s r € S. 

Output: s = GCD(si, . . ., s r ) € S and (m, . . ., u r ) G S r such that 

S = Ml • Si + • ■ • + U T ■ S r . 



if si = • • • = s r then return (1,0,..., 0; si); 

Assume that <?(si) 0, . . ., g(s r ) =fc 0, and 
g(si) <■■■< g(s r ); 

Insert the following elements into a queue Q 
in the ascending order of their g values; 
(wi,i, . . . ,wi, r ; wi) := (1,0, ... , 0; si); 

(w r ,i, ■ ■ ■ ,t»r, r ;t» r ) := (0,0, ... , l;s r ); 



while Q is nonempty loop 
if \Q\ = 1 then 

return the queue element (wi,i, . . . , wi, r ;wi); 
end{if }; 

Dequeue the following first two elements of the queue Q: 
(u>i,i, . . . ,wi, r ; wi) and (102,1, ■ • • , w 2 , r ; w 2 ); 

Let W2 — q ■ wi + r; 

Comment: This is computed by an application of the 
division algorithm; 

Enqueue (wi,i, . . . , wi) in the queue Q; 
if r 7^ then 

Enqueue (w 2 ,i, . . . ,W2, r ;w 2 ) - q ■ (11)2,1, ■ • • , w 2 , r ; w 2 ); 
end{if }; 
end{loop }; 
end{ Extended-Euclid} □ 

The correctness and termination of the algorithm follows from the fol- 
lowing easily verifiable facts. Assume that at the beginning of each iteration 
the queue Q contains the following t (0 < t < r) elements 

wT = (101,1, . . . ,w\y,w\), 



w t = (w t! i,...,w t y,w t ). 

1- 9(wi) <■■■ < g{wt)- 

2. For all j (0<j<t), 

w j,l ' s l + ' ' ' + w j,r ' S r = Wj. 
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3. (wi, . . ., w t ) = (si, . . ., s r ). 

4. If the queue Q = [uJT, . . ., W[] before the main loop and Q' = [w' lt 
. . ., w' t ,] at the end of the main loop, then 

g(wi) > g(w[) and t > t', 

with one of the inequalities being strict. 

As an immediate consequence of (4), we get the termination with a 
single tuple 

(tui,i, . . .,wiy,wi). 
Since (wi) = (si, . . ., s r ) by (3), we have 

tin = GCD(«i,... ) « r ) 

at the termination. The rest follows from the condition (2). 

Since GCD(si, . . ., s r ) = GCD(GCD(si, . . ., s r _i), s r ), we could have 
computed the GCD of a set of elements by repeated applications of Euclid's 
successive division algorithm for pairwise GCD. Note that the pairwise 
GCD algorithm computes the GCD of a pair of elements si and s 2 with 
a time complexity of 0(g ■ C), where g — min(g(si), 5(52)) and C — cost 
of a division step. The algorithm presented here derives its advantage 
from computing the pairwise CCD's in an increasing order of the g- values, 
starting with Sj of the smallest g(si), and thus has a time complexity of 

0({g + r)-C), 

where g = min(g(s 1 ), . . ., g(s r )). 

Syzygy-Solvability 

Let {si, . . ., s r } C S. In this subsection, we show that the Euclidean 
domain S is syzygy-solvable. 

s = GCD(si, . . . , s r ) = u\ ■ si H h u r ■ s r 

and s'i — Si/s for i = 1, . . ., r. 

Then the syzygy basis for (s\, . . ., s r ) is given by 

tl = ({u 2 s' 2 -\ hUrS'r), —U 2 s' i , . . . , —U r s\) 

*2 = (-Uis' 2 , (tiis'i + U 3 s' 3 H hMrs'r), -U 3 s' 2 , ■ ■ ■ , ~U r s' 2 ) 

t r = ( — Uis' r ,...,—U r -is' r , (Uis'i + • • • + U r -is' r -i)) 
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To see that (ti, . . ., t r ) is really a basis for syzygy, we may prove the two 
required conditions, as in the case of Z. 

There is another syzygy basis for (si, . . ., s r ) which has a simpler struc- 
ture. Let {si, . . ., s r } C S, 

s = GCD(si, . . . , s r ) = u\ ■ s\ H + u r ■ s r , 

and Sij — GCD(si, sj) for all 1 < i < j < r. Then the syzygy basis for 
(si, . . ., s r ) is given by the following basis: 

nj = (o,...,o, ,o,...,o, -— , o,...,o), 

position i position j 

for 1 < i < j < q. Again the arguments to prove that it is a basis for the 
module of syzygies is identical to that given in the case of Z. The proofs 
are left to the reader. 



Problems 

Problem 6.1 

(i) Show that the extended Euclidean algorithm can compute the GCD 
of two integers b\ and hi in time 9 (log |6i| + log | i>2 1 — |- 1 ) time. 

(ii) What is the time complexity of the extended Euclidean algorithm 
for computing the GCD of r integers &i, . . ., b r . 

(iii) Devise an efficient algorithm to determine if the following linear 
diophantine equation with rational coefficients has an integral solution: 

&i x\ + ■ ■ ■ + b r x r = c. 



Problem 6.2 

The complex numbers a = a + ib (a and b are integers) form the ring of 
Gaussian integers: if a = a + i b and 7 = c + i d are two Gaussian integers, 
then 

a + 7 = (a + c) + i(b + d), 
—a = —a — ib, 
a ■ 7 = (ac — bd) + i(ad + be). 

Let g(a) be defined to be the norm of a, given by a 2 + b 2 . Show that the 
ring of Gaussian integers with the above g map forms a Euclidean domain. 
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Problem 6.3 

Prove the following: 

Let S be a unique factorization domain. Then every prime element of 
S generates a prime ideal and every nonprime element of S generates a 
nonprime ideal. 

Problem 6.4 

Let S be a Euclidean domain with identity. 

(i) Show that if Si and s 2 arc two nonzero elements in S such that 
Si | S2, then s\ || s 2 if and only if g(s2) > g{si). 

(ii) Using the above proposition, prove the following: 

Let s e S be a nonunit element. Then s can be expressed as a finite 
product of indecomposable elements of S. 

Problem 6.5 

Let S be a unique factorization domain, and A(x) and B{x) G S[x] two 
univariate polynomials of respective degrees m and n, m> n. 

(i) Show that 

Content (GCD (A, B)) w CCD (Content (A), Content (B)), 
Primitive(GCD(A, J B)) w GCD(Primitive(A), Primitive (B)). 

(ii) Let m > n > k. Show that deg(GCD(A, B)) > k + 1 if and only if 
there exist polynomials T(x) and U(x) (not both zero) such that 

A(x) T{x) + B{x) U(x) =0, deg(T) < n - k - 1, 



(iii) Let m > n > k. Suppose that deg(GCD(A, B)) > k. Then for all 
T(x) and U(x) [not both zero, and deg(T) < n — k—1, dcg(J7) < m — k—1], 
the polynomial C(x) = A(x) T(x) + B{x) U(x) is either zero or of degree 
at least k. 

Problem 6.6 

Consider a Noetherian UFD that, in addition to the ring operations, 
allows constructive algorithms for 

(a) factorization and 

(b) 1-dctachability for relatively prime elements, i.e., 



deg(J7) < m — k — 1. 



if l = GCD(pi,p2) 
then I compute a\, a 2 ) 1 = aipi + a2}?2 




Show that such a UFD is a strongly computable ring. 
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Problem 6.7 

Let S be an integral domain and h, . . ., l Tl pairwise coprime ideals of 
S. Prove that there exists a natural isomorphism 



s/f]ii = ms/ii). 



As an immediate consequence of the above statement, we get the fol- 
lowing corollaries Chinese remainder theorems: 

(i) Let mi, . . ., m r be pairwise coprime integers. Then for any set a\, 
. . ., a r of integers, the following system of congruences 

x = a\ (mod mi) 
x = a 2 (mod m 2 ) 



x = a r (mod m r ) 

admits an integer solution. 

(ii) Let fi(x), . . ., f r (x) <E K[x] be pairwise coprime univariate poly- 
nomials over the field K. Then for any set a\{x), . . ., a r (x) of polynomials, 
the following system of congruences 

g(x) = ai(x) [mod/i(x)] 
g(x) = a 2 (x) [mod f 2 (x)} 

g(x) ee a r (x) [mod f r (x)\ 

admits a polynomial solution g. 

Finally, devise an algorithm to solve the congruence relations in each 
case. 

Hint: First, show that there is always a solution to the following sys- 
tem of congruences, for each i: 

x ee (mod ii) 
x ee (mod I 2 ) 



x ee 1 (mod /,) 

x ee (mod I r ). 
For each 1 < j < r (j ^ i), we can choose pj € Ii and qj e Ij such that 

Pj+Qj = L 
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Then the following a; is a desired solution to the system of congruences: 

X = qi ■ ■ -<7i-i q i+ i ■ ■ -q r 

= (1 - Pi) ■ ■ ■ (1 - Pi-l) (1 - Pi+l) ■ ■ ■ (1 - Pr) 

Now it remains to be shown that the solution to a general system of 
congruences can be obtained as a linear combination of the solutions to 
such special systems. 



Problem 6.8 

Let f(x) € lj\~r\] (p = a prime) be a square- free polynomial with the 
following factorization into irreducible polynomials: 

f(x) = fi(x) ■■■ f r {x), 

where no factor occurs more than once. 

(i) Show that given a set of r distinct elements oi, . . ., a r € Z,, there 
exists a polynomial g(x) [deg(g) < deg(/)] such that 

fi(x) | g(x) - at. 

(ii) Show that 

g(xf - g(x) = g(x) (g(x) - 1) • • • (g(x) - p + 1) = (mod f(x)), 
for some g(x) G \ Z,, deg(g) < deg(/). 



Problem 6.9 

Let f{x) G Z,[^] (p — a prime). Show the following: 

(i) Suppose that there is a polynomial g £ deg((/) < deg(/), 
such that 

g(xy-g(x) = [mod f(x)}. 

Then, for some a € Z,, GCD(/, ^(a;) — a) = a polynomial factor of f(x). 

(ii) Conclude that 

/ € Z, [^] is an irreducible polynomial 

if and only if 

g( x ) P - 9(x) = [mod /(a;)], deg(g) < deg(/), 
has no polynomial solution. 
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Problem 6.10 

(i) Using linear algebra, devise an efficient algorithm to determine a 
polynomial solution g(x) <E Z, [/"■>] (p = a prime) for the following congru- 
ence equation 

g(x) p - g(x) = [mod f(x)}, deg(<?) < deg(/), 

where f(x) G Z,[r\] is given. 

(ii) Devise an algorithm to factor a square- free polynomial f(x) € Z, [s^]. 



Problem 6.11 

Let f(x) £ Z[^], and say it factorizes as / = gh. 

Show that for all k > 0, if fk = /(mod p k ), gk = g (mod p k ) and 
hk = ft.(mod p k ) then 

fk = 9k h k (mod p k ). 

Devise an algorithm which, given / and a factorization (mod p k ) (p = 
a prime), 

fk = gk h k (mod p k ), 
can compute a factorization (mod p k+1 ) 

fk+i = gk+i h k +i (mod p k+1 ). 



Solutions to Selected Problems 

Problem 6.2 

We will show that ring of Gaussian integers satisfy all three conditions 
of the Euclidean domain. 

1. a ■ 7 = (ac — bd) + i(ad + be). 

If a, 7 ^ but a ■ 7 = 0, then ac = bd, ad = —be (i.e., abd 2 = 
—abc 2 ), and therefore d 2 = —c 2 , which is impossible as a, b, c, d e Z. 
Hence 

a, 7^0 => «7 7^ 0. 

2. #(a • 7) > g(a) and g(a ■ 7) > .9(7). 



g(a ■ 7) = (ac - bd) 2 + (ad + be) 2 

= (ac) 2 + (bd) 2 + (ad) 2 + (be) 2 > a 2 + b 2 = g(a) 
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3. Let 7 denote the complex conjugate of 7, i.e., 7 = a — ib, if 7 = a + i b. 
Let a - 7 = p + i q and 7-7 = n,n£Z. Let p = qm + n, q = qm + r 2 , 
where |n|, \r 2 \ < n/2. Then obviously 

a • 7 = (Qi + i Q2) -n + (ri + ir 2 ) = q 7 • 7 + r, 

S v ' S v ' 

— q —r 

and 

r\ + r\ < (^j (n 2 + n 2 ) < g(n) = .9(7 • 7). 

Also, 

g(r) = g{a ■ 7 - q-y ■ 7) < 3(7 • 7) 
But 3(a-7-<n-7") = 9ia-q^)g{^) and 3(77) = 3(7)5(7). Therefore 

- 97)5(7) < .9(7).9(7)- 

Since 7 7^ 0, 3(7) is a positive integer. Therefore, 3(0; — 97) < 3(7). 
Hence, we can write a = qj + r' , where r' — a — qj. 

Problem 6.4 

Let 5 be a Euclidean domain with identity. 

(i) Let si and s 2 be two elements in S such that s\ | s 2 . Then Si \\ s 2 
if and only if g(s 2 ) > ff(si). 

Proof. 

Since si | s 2 , g(si) > g(s 2 ). But (si) = (s 2 ) implies that 3(si) > 3(s 2 ) > 
ff(si), and 3(si) = ff(s 2 ). And (s 2 ) £ (si) implies that g(s 2 ) > ff(si)- 
Hence (s 2 ) £ (si) if and only if g(s 2 ) > g(s\). 

The rest follows from the fact that si || s 2 if and only if si \ s 2 and 
Si 76 s 2 , that is, if and only if (s 2 ) ^ (si). D 

(ii) Let s E S be a nonunit element in the Euclidean domain S . Then 
s can be expressed as a finite product of indecomposable elements of S. 

Proof. 

The proof is by a complete induction on g(s). 

• Case 1: s = nonunit indecomposable element. 
In this case there is nothing more to prove. 

• Case 2: s = nonunit decomposable element. 

Assume that s admits a nontrivial factorization s = t\ ■ t 2 , where neither 
t\ nor t 2 is a unit. Thus t\ \ s and t\ 76 t 2l and t\ || s. Similarly t 2 \\ s. 
Then g{t\) < g(s) and g(t 2 ) < g(s). By the induction hypothesis, t\ and 
t 2 can be expressed as finite products of indecomposable elements, and so 
can be s. □ 
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Problem 6.5 
(i) 



GCD(A,B) = GCD (Content (A) -Primitive^), 
Content(B) • Primitive^)). 

Let g = GCD(Contcnt(A), Content (E)) and 

Content (A) Content(B) 

u= , v= , 

9 9 

where u and v are relatively prime. 

GCD(A,B) = GCD^uPrimitivef/l), g v Primitivc(B)) 
= g ■ GCD^w Primitivc(A) , v Primitive(-B)) . 

We claim that 

GCD^m Primitivc(A), v Primitivc(i?)^ 
w GCD(Primitive(A), Primitive^)) 

It is obvious that 

GCD(Primitive(A), Primitive(B)) 

| GCD(uPrimitive(A), v Primitivc(B)^ . 
If they are not associates, then (3 c G S[x]) such that c is not a unit and 
c- GCD(Primitive(A), Primitive^)) 

| GCD(wPrimitivc(A), v Primitive(-B)) , 

but c cannot have degree > 0, therefore c € S which implies that c | u and 
c | v, contradicting the fact that u and v are relatively prime. Therefore, 

GCD(A, B) 

= GCD(Contcnt( J 4),Contcnt(B)) • GCD(Primitivc(A), Primitivc(B)) . 

But Content(GCD(Primitive( J 4), Primitive(S))) is a unit. Hence, 

Content(GCD(A,B)) w GCD (Content {A), Content (B)), 
Primitivc(GCD(A, J B)) w GCD(Primitive(A), Primitive (B)). 
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(ii) Let G{x) = GCD(A(x), B{x)) and A{x) = G{x) ■ U(x), B{x) = 
G{x) ■ T{x). Since deg(G(x)) > k + 1, deg(U(x)) < m - k - 1 and 
deg(T(x)) <n-k-l, 

A(x) T{x) - B{x) U{x) = G{x) U(x) T(x) - G(x) T(x) U(x) = 0. 

Conversely, let 

A(x) T{x) + B(x) U(x) = 0, deg(T) < n - k - 1, 

deg(U) < m — k — 1. 

Since S[x] is UFD, we can have unique factorization of A(x), B(x) 7 T(x), 
U(x). Let 

G(x) =GCT>(a(x), U(x)}, deg(G) < m - k - 1. 

Let = G(x) • P(x); then P(x) and C/(x) are relatively prime, and 

therefore P(x) \ B(x). But deg(P(x)) = deg{A(x)) - deg(G(x)) > k + 1. 
Since P(x) divides A{x) as well as i?(x), 

P{x) | GCD(A(x), S(x)). 

Then deg(GCD(A(x), J B(x))) > deg(P(x)) >k + l. 

(iii) Let G(x) = GCD(A(x), B(x)), 

A(x) = G(x) P(x) and B(x) = G(x) Q(x). 

Therefore 

C(x) = G(x)P(x)U(x)+G(x)Q(x)T(x) 
= G(x) ■ (P(x) U(x) + Q(x) T(x)Y 

If P{x)U(x) + Q(x)T(x) is not zero, then by the property of Euclidean 
domain, deg(G(x)) > dcg(G(x)) > k. 
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Resultants and 
Subresultants 

7.1 Introduction 

In this chapter we shall study resultant, an important and classical idea 
in constructive algebra, whose development owes considerably to such lu- 
minaries as Bezout, Cayley, Euler, Hurwitz, and Sylvester, among others. 
In recent time, resultant has continued to receive much attention both as 
the starting point for the elimination theory as well as for the compu- 
tational efficiency of various constructive algebraic algorithms these ideas 
lead to; fundamental developments in these directions are due to Hermann, 
Kronecker, Macaulay, and Noether. Some of the close relatives, e.g., dis- 
criminant and subresultant , also enjoy widespread applications. Other ap- 
plications and generalizations of these ideas occur in Sturm sequences and 
algebraic cell decomposition — the subjects of the next chapter. 
Burnside and Panton define a resultant as follows [35]: 

Being given a system of n equations, homogeneous between 
n — 1 variables, if we combine these equations in such a manner 
as to eliminate the variables, and obtain an equation R = 
containing only the coefficients of the equations, the quantity 
R is, when expressed in a rational and integral form, called the 
Resultant or Eliminant. 

Thus, a resultant is a purely algebraic condition expressed in terms of 
the coefficients of a given system of polynomials, which is satisfied if and 
only if the given system of equations has a common solution. There have 
been historically two ways to view the development of resultant: the first 
algebraic and the second geometric. 

In the first case, one starts from Hilbert's Nullstellcnsatz, which states, 
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for instance, that given a pair of univariate polynomials /i and f 2 (say over 
a field), they have no common solution exactly when there exist polynomi- 
als gi and g 2 satisfying the following: 

figi + $292 = 1. 

A quick examination would convince the reader that if such polynomials 
gi and g 2 exist then their degrees could be bounded as follows: deg(<7i) < 
deg(/ 2 ), and deg(g 2 ) < deg(/i); and that their existence can be determined 
by only examining the coefficients of /1 and f 2 . Thus, at least in principle, 
an algebraic criterion can be constructed to decide if the polynomials have a 
common zero. This is essentially Sylvester's dialytic method of elimination. 

In the second case, one examines the zeroes (in an algebraic extension) 
of the polynomials. Say the zeros of /1 are a\, a 2 , ■ ■ ., and the zeros of / 2 , 

0i, fo, Then the following is clearly a necessary and sufficient algebraic 

condition for fi and f 2 to have a common solution: 

cute ^) = c fl n/2(«i) = c h n/i(&) - 0, 

where C's are nonzero constants. Since these conditions are symmetric in 
the a's as well as /3's, one can express the above conditions in terms of the 
coefficients of fi and f 2 (which are also symmetric polynomials of a's and 
/3's, respectively). 

These discussions should make apparent that resultants are also inti- 
mately connected to the computation of GCD of two polynomials and the 
related Bezout's identity: 

/ifli + / 2 S2 = GCD(/ 1 ,/ 2 ). 

In an exploration of the extension of the extended Euclidean algorithm to 
the polynomials, we shall also encounter polynomial remainder sequences, 
the connection between pseudodivision and resultant-like structures (in 
particular, subresultants), and various efficient computational techniques. 

This chapter is organized as follows: Sections 7.2 and 7.3 introduce 
resultant in a rather general setting and discuss some of their properties. 
Section 7.4 discusses discriminants — a concept useful in testing whether a 
polynomial in a unique factorization domain has repeated factors. Next, 
in Section 7.5, we consider a generalization of the division operation from 
Euclidean domains to commutative rings by the "pseudodivision" and how 
it leads to an "extended Euclidean algorithm" for polynomials. We also 
describe determinant polynomials, a useful tool in proving results about 
pseudodivision. Section 7.6 touches upon the subject of polynomial re- 
mainder sequences, which is then related to the concept of subresultants 
and subresultant chains. The last two sections explore these connections 
in greater details. 
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7.2 Resultants 

Let S be a commutative ring with identity. Let A(x) and B(x) G S[x] be 
univariate polynomials of respective positive degrees m and n with coeffi- 
cients in the ring S. 

A(x) = a m x m + a m -ix m ~ l + ■ ■ ■ + a , dcg(A) = to, and 
B(x) = b n x n + bn-xx* 1 - 1 + ■ ■ ■ + b , deg(B) = n. 



Definition 7.2.1 (Sylvester Matrix) The Sylvester matrix of A{x) and 
B{x) e S[x], denoted Sylvester(A, B), is the following (to + n) x (to + n) 
matrix over S: 

Sylvester(A, B) 



In particular, the first n rows of the Sylvester matrix correspond to the poly- 
nomials x n ~ 1 A(x), x n ~ 2 A(x), . . ., A(x), and the last to rows, to x m ~ 1 B(x), 
x m - 2 B(x), B(x). □ 



Definition 7.2.2 (Resultant) The resultant of A(x) and B(x), denoted 
Resultant(A, B), is the determinant of the Sylvester matrix Sylvester^, B), 
and thus is an element of S. □ 

Since Sylvester(i?, A) can be obtained by to • n row transpositions, we 
see that 




n staggered 
rows of 
coefficients 
of A 



in staggered 
rows of 
coefficients 
of B 



Resultant^, .4) = det(Sylvester(i3, A)) 



= (-l) mn det(Sylvester(A,B)) 
= (-l) mn Rcsultant(A,B). 
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Properties of Resultant 

Lemma 7.2.1 Let S be a commutative ring with identity, and A(x) and 
B{x) <E S[x] be univariate polynomials of respective positive degrees m and 
n with coefficients in the ring S. Then there exist polynomials T(x) and 
U(x) <E S[x] such that 

A(x) ■ T(x) + B(x) ■ U(x) = Rcsultant(,4, B), 

where deg(T) < deg(-B) = n and deg(U) < deg(A) = m. 

PROOF. 

Consider the Sylvester matrix of A and B: 

Sylvester(A, B) 

a m 1m— l • • • tto 

1m 1 m -l ■ • ■ CJ() 



0"m &m-l ' ' ' 

bn b n -l ■ ■ ■ ■ ■ ■ bo 

bn b n -\ ■ ■ ■ ■ ■ ■ bo 

bn b n -l bo 



Let us create a new matrix M' from M by following elementary matrix 
operations: 

1. First, multiply the i th column by x m+n ~ l and add to the last column 
of M. 

2. All but the last column of M' are same as those of M. 
By definition, 

Resultant (A,B) = det(M) = det(M'). 
We observe that matrix M' is as follows: 



Sylvester(^4, B) 



bn bn-l 

bn bn-l 



fflo 

&m drn—1 

bo 

bn-l 



Ei=0 aiX 
bo U= biX m+i - 2 



> n rows 



> m rows 
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dm &m — l ' ' ' 

dm dm — 1 

bn b n -l 

bn bn-l 



do 

do 

dm dm—1 

bo 



x n - 1 A(x) 
x n ' 2 A(x) 

A(x) 



> n rows 



> m rows 



6 x m - 2 B(x) 

-1 B(x) 

Note that since the last column of the matrix M' is simply 

x^A^x), A(x), x^B^x), B(x)} T , 

we can compute the dct(M') explicitly by expanding the determinant with 
respect to its last column. We then have the following: 

Resultant (A,B) = det(M') 

= x n - l A(x) ■ M' hm+n + ■■■ + A(x) ■ M^ m+n 

+ x m - l B(x) ■ M; +lim+ „ + • • • + B(x) ■ M' m+ntm+n 

= A{x) (Mi^x™- 1 + ■■■ + M; ;ro+ „) 

+ B(x) (m'^^x™- 1 + ■■■ + M' m+nW ) 
= A(x)-T{x) + B(x)-U(x). 

Note that the coefficients of T(x) and U(x) are cofactors of the last column 
of M', and hence of M, and are ring elements in S. 
Clearly, 

deg(T) < n - 1< dcg(B) and deg(f7) < m - 1< deg(A). □ 



Lemma 7.2.2 Let A(x) and B(x) be univariate polynomials of respective 
positive degrees m and n, over an integral domain S. Then 

Rcsultant(A, B) = 

if and only if there exist nonzero polynomials T(x) and U(x) over S such 
that 

A(x)-T{x)+B{x) ■ U(x) = 0, 
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where deg(T) < deg(B) = n, and deg(U) < deg(A) = m. 

PROOF. 

(=>) By Lemma 7.2.1, Resultant(y4, B) = implies that there exist uni- 
variate polynomials T(x), U(x) € S[X] such that 

A(x) ■ T(x) + B(x) ■ U(x) = Rcsultant(A, B) = 0, 

where deg(T) < dcg(B) = n and deg(U) < deg(A) — m. 
Thus we may assume that 

T(x) = t n _ x x n - x +t n _ 2 x n - 2 + --- + t , 
U(x) = u m -\x m ~ x + u m - 2 x m ~ 2 H \-u , 

where U, Ui G S. 

We claim that not all ti's and Ut's are zero. Since 



A(x) ■ T(x) + B(x) ■ U(x) = 0, 



we have 



tn — 1 ' ®>m ~t~ ^m— 1 ' b n 



t ■ a + u ■ b 



i.e. 



-1 • b n -l 


f «m-2 


• b n 


= 


h 






= 
= 


t n -i 




' " 




to 

U m -1 














_ _ 





(7.1) 



Sylvester(A, B) 1 



But since det(Sylvester(A, B)) = Resultant (A, B) = by assumption, and 
since we are working over an integral domain, the system of equations 7.1 
has a nontrivial solution. That is, not all ti's and Uj's are zero, as claimed. 

(<=) Conversely, assume the existence of T(x) and U(x) as in the state- 
ment of the lemma: 

T(x) = t p x p + t p - 1 x p ~ 1 +...+to, 

U(x) = U q X Q + Uq-lX^ 1 H hliO, 



where 



t p ^ 0, u q ^ 0, p < n, and q < m. 
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that 

tn-l 



We may then write T{x) and U(x) as below, while tacitly assuming 
• • = tp+i =0, t p ^ 0, u m _i = • • • = u q+1 = 0, and u q ^ 



t n -l 




" " 


to 







V>m— 1 







M 




_ _ 



T{x) = tn-ix™- 1 + t n _ 2 x n - 2 + ■ ■ ■ + t , t n _l ^ 0, 
E/(x) = Um-ia;" 1 " 1 + u m - 2 x m ~ 2 H hu , u m _i 7^ 0. 

Now, expanding the expression A(x) ■ T(x) + B{x) ■ U(x) = 0, we see 
that the following linear system 



Sylvester(A, B) 1 



has a nontrivial solution. Since Sylvester(A, B) is over an integral domain 
S, we have 

det(Sylvester(A, B)) = Resultant(A, B) = 0. □ 



Lemma 7.2.3 Let S be a unique factorization domain with identity, and 
A(x) and B{x) be univariate polynomials of positive degrees with coeffi- 
cients in S. Then 

Rcsultant(A, B) = 
if and only if A{x) and B{x) have a common divisor of positive degree. 

PROOF. 

(<=) Assume that A(x) and B(x) have a common divisor C(x) of positive 
degree. Then, since S is an integral domain, 

A(x) = C(x) ■ U(x), dcg(J7) < deg(A), and 
B(x) = C(x)-T(x), deg(T) < deg(B). 



Therefore, 



A(x) ■ T(x) + B(x) ■ (-U(x)) 

= C(x) ■ T{x) ■ U(x) - C(x) ■ T(x) ■ U{x) = 0. 



Thus by Lemma 7.2.2, Resultant(.4, B) = 0. 
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(=>) Since Resultant(j4, B) = 0, we know by Lemma 7.2.2 that there exist 
two nonzero polynomials T(x) and U(x) € S[x] such that 

A(x) ■ T(x) - B(x) ■ U(x) = Rcsultant(A, B) = 0, 

where deg(T) < deg(£>) = n, deg(C7) < deg(A) = to, and 

A{x) ■ T{x) = B{x) ■ U(x). 

By the Gauss lemma, S[x] is also a unique factorization domain. Therefore, 

Primitive(A) • Primitive(T) = Primitive(_B) • Primitive(/7). 

Thus 

• ■■A m ,(x) ■ Ti(x) • ■ -Tp/(x) - Bi(s) • --B^x) ■ U^x) ■ ■ ■ U q ,{x), 

where 

A\(x) ■ ■ ■ A m '(x) is a primitive factorization of A(x), 
T\{x) ■ ■ ■ Tp'(x) is a primitive factorization of T(x), 

B\{x) ■ ■ ■ B n '(x) is a primitive factorization of B(x), 
Ui(x) ■ ■ ■ U q '(x) is a primitive factorization of U(x). 

By the uniqueness of the factorization, each A, (x) is an associate of a Bj (x) 
or an associate of a Uk{x). Since deg(Ai • • • A m /) > deg(J7i • • • U q >), there 
must exist an Ai(x) that is an associate of a Bj(x). 

Therefore, Ai{x) is a primitive polynomial of positive degree, and is a 
divisor of both A(x) and B(x). □ 



7.3 Homomorphisms and Resultants 

Let S and S 1 * be commutative rings with identities, and 

<f>: S-> S* 

be a ring homomorphism of S into S* . 

Note that <j> induces a ring homomorphism of S[x] into <S*[x], also de- 
noted by <f), as follows: 

: S[X]^S*[x] 

: a m x m + a m -\x m ~ x H ha 

' ^ 4>(a m )x m + ^{am-^x" 1 - 1 + ■■■ + <j>{a ). 
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Lemma 7.3.1 Let A(x) = YlT=<d aixl an< ^ B(x) = X)"=o ^ iX% ^ e ^ wo u ™~ 
variate polynomials over the ring S with deg(A) = m > and deg(B) = 
n > 0. If 



deg(<p(A)) = m and deg(cf>(B)) = k, (0 < k < n) 



then 



^(Resultant (A, B)) = 0Kn) ■ Resultant (<p(A), <j>(B)) . 



PROOF. 

Let 



A* = <P(A) = 0(a m )x m + 0(a m _ 1 )x m - 1 + --- + 0( ao ), and 
B* = <j>{B) = <j)(b k )x k + ^(b k ^ 1 )x k - 1 +... + 0(6o). 

Then, M, the Sylvester matrix of A(x) and B(x), is 
M = Sylvester(^, B) 

dm dm— 1 ' ' ' 0,0 

a m dm— 1 ' ' ' dO 



d m dm— 1 ' ' ' do 
&n fen-1 • • • • • • fe() 

fen fen-1 • • • • • • feo 



fen fen-1 fe() 

and M*, the Sylvester matrix of A* (x) and B*(x), is 
M* = Sylvcstcr(A*,5*) = 

(o m ) 0(d m _l) ••• 0(d O ) 

0(d m ) 0(d m -l) ••• 0(do) 



0(fo fe ) 0(6 fc _i) 

0(fe fc ) 4>{bk-i) 



(am) 0(d m -l) 

<^(6o) 



0(do) 



<M&o) 



rows 



m 

rows 



4>(b k ) 0(fo fc _x) 0(fo„) 

The matrix M* is obtained from M by the following process: 

1. First, the matrix, 4>(M), is computed by replacing the entry a, by 
4>(ai) (for all < i < m), and by replacing the entry bj by 4>(bj) (for 
all < j < n). By assumption 



<j>(b n ) = ■■■ =<f>(b k +i) = 0. 
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2. Next, from <f>(M), the first (n — k) rows and (n — k) columns are 
deleted, yielding an (in + k) x (m + k) matrix equal to M* . 

Thus 

4>(M) 

(piQui 

) 4>(a m -i) ■■■ (f>(ao) 

(t>{a m ) (j)(a m -i) ■■■ 4>(a ) 

M* 

Therefore, 

</>(Resultant(A,S)) = ^(det(Sylvester(A, B))) 

= det(0(M)) - 4>(a m ) n - k • det(0(M*)) 
= 0(a m )" _fe • 0(det(Sylvester(A*, J B*))) 
= 0(am) n_fc • (/-(Resultant (A*, B*)). 

Therefore, 

0(Resultant(A, B)) = cp(a m ) n - k • (^(Resultant (A*, £?*)). □ 
7.3.1 Evaluation Homomorphism 

Let S be a commutative ring with an identity, and (cei, . . . , a r ) £ S r be 
an r-tuplc. Define a ring homomorphism 4> ai ....,a r , called the evaluation 
homomorphism, as follows: 

4 > ai,...,a r '■ S[xi, . . . , X r ] > S 

: x\ i— > ai, 

Note that, if F(xi, . . ., x r ) € S[xi, . . ., x r ], then we shall write 
F(a l7 ...,a r ) for ^ c ,i,...,a I .(-F 1 )- 



(n-fe) 
rows 



(m + fc) 
rows 
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Definition 7.3.1 Let S be a commutative ring with an identity, and 
A{x\,...,x r ) = Aj(xi, . . .,x r -i)x l r € S[xi, . . . ,x r ], and 

i=0 
n 

B(xi,...,x r ) = y^Bjfa, . . .,x r -i)x l r g S[xi, 



i=0 



be two polynomials in S[x\, . . ., x r ] of respective positive degrees m and n 
in x r - Let 



Resultant^,. (A, B) 



B n B n -i ■ ■ ■ ■ ■ ■ Bo 

B n -Bn-l • • • • • • Bq 



> m rows 



Bn B n -\ ■ ■ ■ ■ ■ ■ Bo 

be the resultant of two multivariate polynomials A(x\, . . ., x r ) and B(x\, 
. . ., x r ), with respect to x r . □ 



Lemma 7.3.2 Let L be an algebraically closed field, and let 

Resultant^ (A, B) = C(xi, . . . , x r _\) 
be the resultant of the multivariate polynomials, 



A(xi,...,x r ) = y]Aj(xi, . . . ,x r -i)x l r € L[x\, . . . ,x r ], and 

i=0 
n 

B(xi,...,x r ) = y^Bj(xi, . . .,x r -i)x l r € L[xi, . . .,x r ]. 

i=0 

with respect to x r . Then 

1. If (ai, . . . , a r ) € L r is a common zero of A(x\, . . . ,x r ) and B{x\, 
. . ., x r ), then C(ai, . . . , a r _i) = 0. 

2. Conversely, if C(a\, . . . , a r _i) = 7 then at least one of the following 
four conditions holds: 

(a) A m (cei, . . . ,a r _i) = • • • = A (a 1 , . . . ,a r _i) = 7 or 

(b) B n (ai, . . . ,a r _i) = • • • = B (a 1 , . . . ,a r _i) = 0, or 
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(c) A m (ai, . . . , a r -i) = B n (ai, . . . , a r -i) = 0, or 

(d) for some a r G L, (a\, . . . , ot r ) is a common zero of both A(x\, 
. . ., x r ) and B(xi, . . ., x r ). 

PROOF. 

(1) Since there exist T and U in L[xi, . . ., x r _i] such that 

A ■ T + B ■ U = C, 

we have 

C(ai, . . . , a r _i) 

= A(ai, . . . , a r ) ■ T(ai, . . . , a r ) + B{a\, . . . , a r ) ■ U(ai, ...,a r ) 
- 0, 

as A(a\, . . . , a r ) = -B(ai, . . . , a r ) = 0, by assumption. 

(2) Next, assume that C(a\, . . ., a r _i) = 0, but that conditions (a), 
(b), and (c) are not satisfied. Then there are two cases to consider: 

1. A m (ai, . . ., a r -i) ^ and 

for some k (0 < k < n), Bk(ai, . . ., a r _i) ^ 
(k is assumed to be the largest such index). 

2. B n (ai, . . ., a r -i) ^ and 

for some k (0 < k < m), Ak(a\, . . ., a r _i) 7^ 
(fc is assumed to be the largest such index). 

Since Resultant (B, A) = (-l) mn Resultant(A, B) = ±C, cases (1) and 

(2) are symmetric, and without any loss of generality, we may only deal 
with the first case. 

Let <t> = (f> ai olr _ 1 be the evaluation homomorphism defined earlier. 

Thus, 

= 4>(C) 

= </>(Rcsultant(A, B)) 

= 4>{A m ) n - k ■ Resultant(0(A), 0(B)), 

and Resultant(^>(A), 4>(B)) = 0, since <f>(A m ) = A m (ai, . . ., a r _i) 7^ 0. 
If k = 0, then 

Resultant O(A), 0(B)) = <p(B Q ) m 

= B (ai, . . . , a r _i) m 7^ (by assumption). 

Hence k > and 4>{A) and <j>(B) arc of positive degree and have a 
common divisor of positive degree, say D(x r ). 
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Since L is algebraically closed, D(x r ) has at least one zero, say a r . 
Therefore, 

A(a\, . . . , a r -i, x r ) — D(x r ) ■ A(x r ), and 
B(ai, . . . ,a r -i,x r ) = D(x r )-B(x r ), 

and (a\, . . ., a r _i, a r ) is a common zero of A and B. D 



Now, consider two univariate polynomials 

A(x) = a m x m + fflin-is"' -1 H ha , deg(A) = m > 0, 

S(a;) = &„x" + 6„_ix"~ 1 + --- + fe , deg(B) = n > 0, 

of positive degrees, with formal (symbolic) coefficients a m , a m _i, . . ., ao 
and b n , b n -\, . . ., 6o> respectively. 

We consider A(x) and -B(x) to be univariate polynomials in the ring 

(Zp>, . . . K , . . . ,f])H- 

Thus, the resultant of A(x) and B(x) with respect to x is a polynomial in 
the ring 

Zp>, . . . k,- •• 
Now, if we consider the evaluation homomorphism 

^a,/3 = <t>a m ,...,a o ,0 n ,...,l3 o 

from Z[D >; . . . , Dp, K , . . . ,p] into a unique factorization domain S as fol- 
lows: 

4>a,p '■ Z[a m ,...,a ,b n ,...,b ]^>S, 



: i-» A), 



: i-» A), 
: i > , 

: n — > 1 H hi, 

v v ' 

n times 

then we can show the following: 
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Lemma 7.3.3 Let A(x) and B{x) be two univariate polynomials with for- 
mal coefficients a m , . . . ,ao, and b n , . . . , bo, respectively. Let 4>afi be any 
evaluation homomorphism for which a m ^ 0, and (3 n ^ 0. 

Then the necessary and sufficient condition that 4> a pA(x) and <j> a pB(x) 
have a common divisor of positive degree is: 

(a m , . . ., ceo, P n , . . ., f3o) satisfies the equation 

Resultant ( A, B) = 0, 
where Resultant(A, B) e Z[D>, . . . , Dj/, K , . . . 

PROOF. 

Let (a m , a , f3 n , f3 ) € S m+ " +2 be a solution to the equation 
Resultant(A, B) = 0. Let 

<!>a,p{ A ) = a mX m + am-iX™^ 1 -\ \-Oi , 

<f>aA B ) = PnX U + [In-lX™- 1 + ■ ■ ■ + fa, 

then 

Resultant (</> a j(A), 4> & ,p{B)) = <j> a ,0 (Resultant(A, B)) = 0, 

and 4> & jjA(x) and 4> & pB{x) have a common divisor of positive degree. 
Conversely, let 

^aj(^) = a mX m + a m -ix m_1 H + a a and 

4>a,p(. B ) = /3n.T" + /3„-ix"- 1 + -.. + /3o 
have a common divisor of positive degree. The assertion above implies that 
<f> & p (Resultant(A, B)) = Resultant [<j> & ^(A), <f) & ,p(B)) = 0, 

and so (a m , . . ., ao, /3 n , . . ., /3o) <= S m+n+2 is a solution to the equation 
Resultant(A, B) = 0. □ 

7.4 Repeated Factors in Polynomials and 
Discriminants 

Let U be a unique factorization domain of characteristic 0, i.e., satisfying 
the following condition: 



n = 1 + • • • + 1 7^ 0, for any positive integer n. 

n 
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Definition 7.4.1 (Differentiation Operator) The formal differentiation 
operator is a map 



a i ► 

A(x) = a m x m H ho i ^ A'(x) = ma m i m_1 H h a 1 

where a, a , . . . , a m £ U and m a m = a m H h a TO . □ 

V v ' 

m 

Let A(x),.B(x), Ai(x), . . . , A m (x) e C/[x]. Then 

1. If A(x) e f/, then A' (a;) = 0. Otherwise, deg(A'(x)) = deg(A(x))-l. 

2. D(-A(a;)) = -D(>l(x)). 

3. D(A(x) + B{x)) = D(A(x)) + D(B(x)). 

4. D(A(x) ■ B(x)) = D(A(x)) ■ B{x) + A(x) ■ D(B(x)). [Chain Rule] 

5. For alH (1 < i < m), 



Definition 7.4.2 (Square-Free Polynomial) Let A(x) £ U[x] be fac- 
torized into indecomposable factors as follows : 

A(x) = Ax{x)---A ml {x) 

A(x) is square-free (i.e., has no repeated factor of positive degree) if 



D : U[x] 



U[x] 



D (A^x) ■ ■ ■ A m (x)) 





If 



3l<i < j <m'\ Ai{x) w A,(x) A dcg(A 4 ) > 



then Aj(x) is called a repeated factor of A(x). □ 



240 



Resultants and Subresultants 



Chapter 7 



Theorem 7.4.1 A polynomial A(x) <G U[x] of degree at least 2 has a re- 
peated factor if and only if Resultant (A, A') = 0. 

PROOF. 

Let 

A(x) = A 1 {x)---A m ,{x) 

Resultant(A, A 1 ) = 

A{x) and A {x) have a common divisor of positive degree 

& (3 1 < i < m') Ai(x) I A'{x) A deg(A 4 ) > 

m' 

Ai(x) I and deg(A;) > 

3=1 

[Since A^x) \ A' t (x) as deg(^) < deg(A 4 ).] 
(3 1 < i < j < m'^j Ai(x)ttAj(x) A dcg(A i ) > 
<=> A(x) has a repeated factor. □ 

Definition 7.4.3 (Discriminant) The discriminant of a polynomial 

A(x) — a m x m H \-a , m>2 

is 



Discriminant (A) = (-l)^™" 1 )/ 2 x 



a m -i 

dm 



ai 



a m -i 



a„ 



dm-l 

m (m - l)a m -i • • • ai 

ma m (m - l)a m _i 



ao 
ai 



ai 



ao 



(m-1) 
rows 



ma m (m-l)a m _i ■■■ a x 
i.e., Resultant(A, A') = (-l)"^™" 1 )/ 2 a m Discriminant (A). □ 
Since [/ is an integral domain, we see the following: 

Corollary 7.4.2 A polynomial A{x) € U[x\ of degree at least 2 has a 
repeated factor if and only if 



Discriminant ( A) =0. D 
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Example 7.4.4 The discriminant of the quadratic polynomial 

A(x) = ax 2 + bx + c 



is 



Discriminant (A) 









1 b 


c 


= (- 


-I) 2 


1 

' 2 


2 b 











2a 


b 






1 


b 


c 


= (- 


■1) 





-b - 


-2c 









2a 


b 


= (- 


-1)( 


-b 2 + 4ac) 





Aac. 

Thus -A(x) has a repeated factor if and only if b 2 — Aac = 0. □ 

Thus, we see that discriminant allows us to reduce the problem of test- 
ing whether a polynomial has repeated factors to a simple determinant 
evaluation — a well-studied problem having efficient algorithms even when 
U is an arbitrary ring with identity. 



7.5 Determinant Polynomial 

As before, let 5 be a commutative ring. 

Definition 7.5.1 (Determinant Polynomial) Let m and n be two non- 
negative integers and M £ s mxn be an m x n matrix with elements from 
S. Define e S mxm , for i = m,...,nas the m x m square submatrix 
of M consisting of the first (m — 1) columns of M and the i th column of 
M, i.e., 

Mi,! ••• Mi i(ro _i) M hi 
M 2 ,i ••• M 2>(ro _ 1) M 2 ,, 

M mj i ■■■ M TOi ( m _i) M TOi j 
The determinant polynomial of M 

n 

DetPol(M) = ^ det(M^)x n -\ 

i—m 

Note that DetPol(M) = if n < m. Otherwise, deg(DetPol(M)) <n-m, 
the equality holds when det(M( m )) ^ 0. 
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From the definition, it is easy to see that, if m < n then 



DctPol(M) = 



M 2jl 



M 



m,l 



M 1;1 
M 2 ,i 



Mo 



(m-l) 



Er= m ^* 



71 — % 

n—i 



J 2,(m-1) 
M m ,(,„-l) EiL m M m ,iX n ~ 

Mi, (ro _i) E?=i M liia : n -' 



M 



2,(m-l) 



M, 



m,(m— 1) 



V n Af -r n ~ 



Afi,i 
M 2 i 



(m-l) 



n— 1 



' ' M 2,(m-1) E"=0 M 2,n-iZ 



n— 1 n—i n— 1 

,n— 2*£ ) 

t=0 i=0 i=0 

Let Ai(or), . . . , A m (x) be a set of polynomials in S[x] such that 
n = 1 + max deg(Ai) L 

l<i<m I J 

The matrix of Ai, ... , A m , M = Matrix^, . . ., A m ) e S mxn is defined 
by: 

Mij — coefficient of .x" - - 7 in Ai(x). 

Define the determinant polynomial of A\ , . . . , A m to be 

DctPol(Ai, . . . , Am) = DetPol(Matrix(Ai, . . . , Am)). 

Note that DetPol(Ai, . . . , A m ) = when n < m. □ 

The determinant polynomial satisfies the following properties: 

1. For any polynomial A(x) — a m x m + ■ ■ ■ + ao € S[x], 

Matrix(A) = [a m , ■ ■ ■ , »o], a 1 x (m+1) matrix; 
DetPol(A) = a m x m H h a = A(x). 



2. DetPol(. . ., A u Aj, ...) = -DetPol(. .., A , .. ., A u . . .). 
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3. For any a € S, 



DetPol(. . . , a ■ A it . . .) = a ■ DctPol(. . . , A u . . .). 



4. For any ai, . . ., aj_i, a i+1 , 



m 




= DetPol(...,A i _i,A i ,A i+ i,...). □ 



Theorem 7.5.1 Let A(x) and £>(x) 7^ be polynomials in S[x] with re- 
spective degrees k and n. Let m be an integer that is at least k and let 

5 = max(m — n + 1, 0) and S = max(fc — n + 1, 0). 

Then 

DetPol(x m -"B, ir" 1 -"- 1 ^, . . . , B, A) 

= b 5 - 5 ' ■ DctPol(x k - n B, x k - n - x B, ...,B,A) 

[Note: Ifp < 0, then DetPol^B, af^B, B, A) = DctPol(,4) = A.] 

PROOF. 

There are three cases to consider. 

• Case 1 (k<n): That is, 5' = 0: Thus 

b s n ' s ' DetPo\(x k ~ n B, B, A) = ^DetPol(A). 

- Subcase A (m < n): That is, 5 = 0: Thus 

6^DctPol(A) = DetPol(A) 

= DetPol(a; m - n B, B, A). 

- Subcase B (to > n): That is, 8 > 0: Thus 
6^DetPol(^) 



b 



n 



> 5 rows 



= DetPol 



6, 



'n 















+ 

} 1 row 



= DetPol(a; m -"B, 



B, A). 
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Case 2 (k > n = 0): Thus 



DetPol(x m -"B, ... ,B,A) 







= DetPol(x ...,B,A) 



= b s ~ s 'T>ei¥o\{x k - n B,...,B,A). 



Case 3 (fc > n > 0): Thus 
b s - 5 'DetPo\(x k - n B,...,B,A) 



fefr 5 'DctPol 



6o 



bo 



a 



> 5' rows 



}1 



row 







• b 










DetPol 






b n ■ 


■ b 


> 


> S rows 




• 


■ ■■ ■ a k ■ 


■ ■ a n ■ 


■ a 


} 1 row 



= DctPol(x m -"B, B, A). □ 
7.5.1 Pseudodivision: Revisited 

Recall the discussion on pseudodivision from Chapter 5. We had shown 
the following: Let 5 be a commutative ring. 

Theorem 7.5.2 Let A(x) and B{x) ^ be two polynomials in S[x] of 
respective degrees m and n: 

A(x) = a m x m + am-ix" 1 ^ 1 H h«o 

B{x) = b n x n + b n _ 1 x n - 1 +--- + b 

Let S = max(m — n + 1,0). Then there exist polynomials Q(x) and R(x) 
in S[x] such that 

b s n A(x) = Q(x)B(x) + R(x) and dcg(R) < deg(B). 

Ifb n is not a zero divisor in S, then Q(x) and R(x) are unique. □ 

For the given polynomials A(x) and B{x) ^ in S[x], we refer to 
the polynomials Q(x) and R(x) in S[x] the pseudoquotient and the pseu- 
doremainder of A{x) with respect to B(x) [denoted PQuotient(A, B) and 
PRcmainder(A, B)], respectively. 

Algorithms to compute the pseudoquotient and pseudoremainder may 
be found in Chapter 5. Here we shall explore some interesting relations 
between pseudodivision and determinant polynomial. 
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Theorem 7.5.3 Let A(x) and B(x) ^ be polynomials in S[x] of respec- 
tive degrees m and n andb n = Hcoef (B) . Let S = max(m — n + 1 , 0) . Then 
a pseudoremainder of A{x) and B{x) is given by: 

&£PRemaindcr(A, B) = ^DetPol^" 1 -™^ B, A) 

PROOF. 

Let 



Then 



PQuotient(A, B) 
PRemainder(A, B) 



b S n -A(x) = (q n 



Q{x) — q m — n x 
R(x). 



+ • • • + go 



Qm—n% 

Hence, we see that 

b s n DctPol(x m -"B, B, A) 
= DctPol(x m - n B, . . . , B, b s n A) 
= DctPol(x m -"S, . . . , B, b s n A - q n 
= DctPol(x m -"B, ...,B,R) 
= b 5 n R, 

since 



" + ■■■ + go) • B{x) + R(x) 
B(x) + ■ ■ ■ + q B{x) + R{x). 



l B q B) 



Matrix(x m ""B, ...,B,R) 
b n b n -\ 



bo 



b n b n -i 



bo 

TO 



> (m — n + 1) rows 



+ 

} 1 row 



where 



B{x) = b n X n +b n -lX n - L +--- + &0 

R(X) = r p x p + r-p-ix^ 1 -\ \-r , and p < n. □ 

Corollary 7.5.4 If in the above theorem b n is not a zero divisor, then 
PRemaindcr(A, B) = DetPol(.T m -™B, x^^B, . . . , B, A). □ 
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7.5.2 Homomorphism and Pseudoremainder 

Let S and S* be two commutative rings and <f>: S — > S* be a ring homomor- 
phism. Then <p induces a homomorphism of S[x] into S*[x] (also denoted 
0): 

a i ► 0(a) 
a m x m H hfl i-> </>(a m )x m H h0(a o ), 

where a, a m , . . . , ao € S*. 

For any set of polynomials Ai, . . . , A m G 

0(DetPol(^i, . . . , A ro )) = DetPol^^), . . . , <f>(A m )). 

provided max(deg(Aj)) = max(deg(0(Aj))). □ 

Theorem 7.5.5 Let A{x) and B{x) ^ be two polynomials in S[x] of 
respective degrees m and n, and b n = Hcoef(B). Let 

S = max(m — n + 1,0), 
k = deg(0(A)) < m, 
n — deg(0(B)), and 
5' — max(fc — n + 1, 0). 

Then 

0(b n ) s 0(PRemainder(A,5)) = 0(b n ) 25 - 5 ' PRemainder(0(A), 0(B)). 

PROOF. 

4>(b n ) 5 0(PRemainder(A,S)) 

= 0(b s n PRcm&inder(A, B)) 

= cp(b 6 n DetPol(x m -"B, B, A)) 

= cf>(b n ) 6 DetPol(x m - n <f>(B), 0(B), 0(A)) 

= <f>(b n ) 2S - s ' DetPol(a; fe -" 0(B), 0(B), 0(A)) 

= 0(b n ) 26 ~ S ' PRemainder(0(A), 0(B)). □ 

Corollary 7.5.6 If in the above theorem 0(b n ) is not a zero divisor of S* , 
then 

0(PRemainder(A, J B)) = 0(b n ) s - s '¥Rcm&mdcT:(0(A),0(B)). □ 

Theorem 7.5.7 Let A(x) and B(x) ^ be polynomials in S[x] of re- 
spective degrees m and n, and b n — Hcoef(_B). Let deg(aA) = k and 
deg(bB) — n, for some a, b £ S. Let 

S = max(m — n + 1,0) and 6' = max(fc — n + 1, 0). 
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Then 

b s n a b 5+s ' PRcmainder(A, B) = b 2 n 6 - 6 ' b 5 PRemainder(aA, bB). 

PROOF. 

b 5 n a b s + 5 ' PRemainder(A, B) 

= b 5 n a b s+8 ' VciPo\{x m - n B, ...,B,A) 

= b s n b s+s ' DctPol(x m -"B, B,aA) 

= b 2 r f- s ' b s+s ' DetPol(x fc -"B, B, aA) 

= bl 5 - 5 ' b 5 DctPol(x fc -" bB, ...,bB, aA) 

= b 2 n s - s 'b s PRemainder(aA, bB). □ 

Corollary 7.5.8 In the above theorem: 

1. If neither b n nor b is a zero divisor, then 

a^'PRemainder^S) = 6^ <5 'PRemainder(a^, bB). 

2. If S is an integral domain and a^O, then 

5 — 5' and 

PRemainder(aA, bB) = a6 <5 PRemainder( J 4, B). □ 

7.6 Polynomial Remainder Sequences 

Let S be an integral domain. 

Definition 7.6.1 (Similar Polynomials) Two polynomials A(x) and 
B(x) in S[x] are similar, denoted 

A(x) - B(x), 

if there exist a, b <G S such that aA(x) = bB(x). We say a and b are 
coefficients of similarity of A(x) and B(x). 

Note that if a and b are units of S, then A{x) and B(x) are associates, 
A(x) m B(x). □ 

Now we can introduce the concept of a polynomial remainder sequence 
(or, briefly, PRS) as follows: 

Definition 7.6.2 (Polynomial Remainder Sequence: PRS) Given 
S an integral domain, and Fi(x), F 2 (x) € S[x], with deg(Fi) > deg(F 2 ), the 
sequence F\, F 2 , . . ., Fk of nonzero polynomials is a polynomial remainder 
sequence (or, briefly, PRS) for F\ and F 2 if we have the following: 
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1. For all i = 3, . . ., k, 

Fi ~ PRemaindcr(F_ 2 , F_i) 7^ 0. 

2. The sequence terminates with PRcmaindcr^^i, F k ) = 0. 

The following two polynomial remainder sequences are of considerable 
interest: 

• Euclidean Polynomial Remainder Sequence, EPRS: 

The polynomial remainder sequence given by the following, 

Fi = PRemainder(Fi_2, F;_i) ^ 0, i = 3,...k, and 
PRemainder(Ffc_i, F k ) — 0, 

is said to be a Euclidean polynomial remainder sequence. 

• Primitive Polynomial Remainder Sequence, PPRS: 

The polynomial remainder sequence given by the following, 

Fi = Primitive(PRemainder(Fj_2, F^)) ¥= 0, i = 3,...k, and 
PRcmainder(F fe _ 1 , F k ) = 0, 

is said to be a primitive polynomial remainder sequence. □ 

From the definition, we see that there must exist nonzero e*, fi G S and 
Qi-i(x) r~j PQuoticnt(Fj_ 2 , -Fi-i) such that 

e l F l - 2 = Qi-i-Fj-i + fiFi, and 

deg(Fi) < deg(fi_i), for i = 3, . . . , k, 

i.e., fiFi = eiF^ - Qi-iFi-i, for all i. 

Also observe that, since pseudodivision is unique, the PRS(Fi,F 2 ) is 
unique up to similarity. Furthermore, 

GCD(Fi, F 2 ) ~ GCD(F 2 ,F 3 ) GCY>{F k -\,F k ) ~ F k , 

so that the PRS essentially computes the GCD of Fi and F 2 up to similarity. 

In defining PPRS, we have reduced the pseudoremaindcr to its prim- 
itive part at each stage of the PRS computation in order to try to limit 
the growth of polynomial coefficients. However, this incurs a (sometimes) 
prohibitively high additional cost of computing the contents of each Fj. 

Definition 7.6.3 (PRS Based on a Sequence) Let S and (thus) S[x] 
be UFD's, and F\(x), F2(x) <G S[x] be two nonzero univariate polynomials, 
with deg(Fi) > deg(F 2 ). Let F u F 2 , . . ., F k be a PRS such that 

(3i- Fi — PRemainder(F i _ 2 , F_i), i = 3, . . . , k, 
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where 0i <G S, and l | Content(PRemainder(Fj_2, then (Fi, . . ., 

Ffc) is a PRS "based on the sequence" (3 = (1, 1, /33, . . ., 0k)- 

Conversely, given a sequence (3 = (1, 1, 03, . . ., (with elements in 
5 1 ), if it is possible to define a PRS bases on (3 as follows: 

PRemamder(F 4 _ 2 ,^-i) , n . „ , , 
F, = ^ 0, i = 3, . . . fc, and 

Pi 

PRemainder(F fc _i, F k ) = 0, 

then we call (3 a well-defined sequence. Note that not all sequences are 
well-defined, and thus it is not possible to obtain a polynomial remainder 
sequence based on an arbitrary sequence. IZI 

In particular, the primitive polynomial remainder sequence 

Fi = Primitive(PRemainder(Fi„2, Fi-i)) 7^ 0, i = 3, . . . k, and 
PRcmainder(F fc _ 1 , F k ) — 0, 

is based on 



= ( 1, 1, Content(PRemaindcr(Fi,F 2 )). 

Content(PRemainder(Ffe_ 2 , -Ffe-i)) )• 



Ai 


= o, 


A 2 = 


h 


= 1, 


6 2 = 


"01 


= 1, 


lf>2 = 



Definition 7.6.4 (Subresultant Polynomial Remainder Sequence: 
SPRS) Let S be a UFD, and Fi(x), F 2 {x) <s S[x] be two nonzero uni- 
variate polynomials, with deg(Fi) > deg(F 2 ). Let Fx, F 2 , . . ., F k be a 
sequence recursively defined with the following initial conditions: 

deg(F 1 )-dcg(F 2 ) + l 
Hcoef(F 2 ) 
(& 2 ) A2 - 1 

and 

1= 2 = l and /3 3 = (-l) A2 
and the following recurrences: 
• For i = 3, . . ., k, 

PRemainder(Fi_2, fi-i) 
4 " 0i 
A, = deg(Ji_i) - deg(fi) + 1 
bi = Hcocf(Fi) 
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• For i = 3, . . ., k — 1, 

A+i = (-l^'tyi-i)*'- 1 

• PRcmainder(i ;l fe_i, Ffc) = 0. 

The sequence of polynomials (iq, F 2 , . . ., F^) is called a subresultant 
polynomial remainder sequence (or briefly, SPRS). □ 

The above definition is somewhat incomplete, since it is not immediately 
seen that ip's are in the domain S or, equivalcntly, that the sequence (3 is 
well-defined. Subsequently, we shall study the well-definedness of f3, define 
the notion of subresultant, and show various relations between the SPRS 
and the subresultant chain. 

The sequence SPRS occupies a special position in computational alge- 
bra, since it allows computation of polynomial remainder sequences without 
excessive growth in the size of the coefficients, or unduly high inefficiency. 

The coefficients of the polynomials involved in EPRS are usually very 
large. In the computation of PPRS, on the other hand, we have to compute 
the contents of the polynomials (using the extended Euclidean algorithm) , 
which makes the algorithms highly inefficient. We will see that the subre- 
sultant PRS seeks a middle ground between these two extreme cases. 

7.7 Subresultants 

We now define the notion of subresultants and then pursue a detailed mo- 
tivation for this definition. 

Definition 7.7.1 (Subresultant) Let S be a commutative ring with iden- 
tity and let A(x), B(x) e S[x] be two univariate polynomials with respec- 
tive positive degrees m and n: 

A(x) = a m x m + a m -\x m ~ x H h a , deg(A) = m > 0, 

B(x) = b n x n + b n _ l x n - 1 + ... + b , deg(B)=n>0, 

and let 

A = min(m,n) and /x = max(m, n) — 1. 

For all i in the range (0 < i < A) , the i th subresultant of A and B is 
defined as follows: 

1. The th subresultant is simply the resultant of the polynomials A and 
B. Thus 
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SubRes (A, B) = Resultant(A, B) 



m ao 



bo 



a m ■ ■ ■ ai ao xA(x) 
a m ■ ■ ■ ai A(x) 



x^Bix) 



b m ■■■ 61 b xB(x) 
b m • • • 61 B(x) 



> n rows 



> m rows 



2. For all i, (0 < i < A), the i th subresultant is: 
SubResi(^,B) 



ai a Q 



l A(x) 



<2i+i at 
a m ■ ■ ■ ctj+i 



a;A(a;) 



61 60 



l B(x) 



h + i bi 
bi+i 



xB(x) 
B(x) 



> (n — i) rows 



(m — i) rows 



The matrix in the above definition is obtained from the previous one 
by removing the top i rows that include the coefficients of A and the 
top i rows that include the coefficients of B. The first i columns now 
contain only zeroes, and they are removed. Finally, the i columns 
preceding the last column are also removed. Thus, in total, 2i rows 
and 2i columns are removed to yield a square matrix. IZI 

Using elementary column transforms we also see that 
SubResi(,4,B) 







a m ■ ■ ■ Oj+i Y?j=o x3a i 



b n 



bo 







6m ••• b i+ i bi Y^]Jo xJ+lb 3 
b m ■■■ bi+i Y^j=o x3 bj 



> (n — i) rows 



► (m- 



(m — i) rows 
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= DetPol^ 71 - 4 - 1 ^, ...,xA,A, x m - l - x B, ...,xB,B 



That is, if 



Mi 



a 



a m ■ ■ ■ cio 



b n ■■■ b 



(n-i) 
rows 

(to — i) 
rows 



m+n—i 



i.e., the matrix obtained from the Sylvester matrix of A and B, by deleting 



1. the first i rows corresponding to A (the upper half), 



2. the first i rows corresponding to B (the lower half), and 



3. the first i columns, then 



Subtlest, B) = DetPol (Mf) 

= det (A^ m+ "- 2i) ) x' + --- + dct (M^ m+ "- 4) ) . 



Thus dcg(SubRcSi) < i, with equality, if M, 



(m+n— 2i) 



is nonsingular. The 



nominal head coefficient of the SubReSi, M^ m+n 2t \ will be referred to 
the i th principal subresultant coefficient (of A and B): 

PSCi(A,B) = NHcocf(SubRcs 4 (A,5)) = det (M^ m+ " _2<) ) . 



as 



Let S be a commutative ring with identity and let A(x), B(x) € S[x] be 
two univariate polynomials with respective positive degrees to and n and 
let 



A = min(m, n) and \i — max(m, n) — 1, 
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as before. We extend the definition of the i th subresultant of A and B, for 
alii (0 < i < fi) as follows: 



Case 1: (0 < i < A). 



SubReSi(^,B) 



ai ao 



bi bo 
b n 

b n 



di+l 0-i 

ai+i 



bi+i bi 
■ ■ ■ b l+1 



xA(x) 
A(x) 

i m " i - 1 B(x) 



xB(x) 
B(x) 



(n — i) rows 



(m — i) rows 



Case 2: (i = A). 

Since we are assuming that A < \i, then \m — n\ — 1 > 0, so we have 
either m>n + lorn>ra+l, with A = n or A = to, respectively. 
The two cases are symmetrical. If A = n, then 



SubRes A (^,B) 

bn 

det 



l B(x) 



b n xB(x) 
B(x) 



(m — n) rows 



Hence 



SubRes A (A,B) = fo™-™- 1 • B(x) 

= Hcocf(B)" 1 -™- 1 • B. 

In the other case, if A = m, then 

SubRes A (A,5) = Hcocf^)"-™- 1 • A. 

Alternatively, we may write 

SubRcs A (A,5) = HcocftC) 1 ™-" 1 - 1 • C, 

where 

(A, if deg(B) > deg(A) + 1; 

C = < B, if deg(A) > deg(B) + 1; 

[ undefined, otherwise. 

• Case 3: (A < i < /j,). 

SubResi(A,B) = 0. □ 

Note that in all cases where SubRes^ is defined, 
deg(SubReSi) < i. 
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Depending on whether the inequality in the above relation is strict, we 
classify the SubReSi as defective or regular, respectively. More formally, 
we have the following: 

Definition 7.7.2 (Defective and Regular Subresultants) Asubresul- 
tant Si, (0 < i < /i) is said to be defective of degree r if 

r = deg(5' J ) < i; 

otherwise, Si is said to be regular. □ 

Sometimes it will be useful to use the following alternative definition of 
the subresultants in terms of the determinant polynomials. 

Proposition 7.7.1 Let S be a commutative ring with identity and let 
A(x), B(x) e S[x] be two univariate polynomials with respective positive 
degrees m and n, and let 

A = min(m, n) and = max(m, n) — 1, 

as before. Then the i th subresultant of A and B, for all i (0 < i < [i), is 
given by: 

• Case 1: (0 < i < A). 

SubRcs,(A, B) = DetPol (x^'^A, . . . , xA, A, x^^B, ...,xB,B). 

• Case 2: (i = X). 
SubResA (A, B) 



( DetPol (x n ~ m ~ 1 A 1 xA, A) 
= Hcocf(A)"- m - 1 • A, if deg(B) > deg(A) + 1; 



= < 



DetPol (a; 1 "-"- 1 ^,..., xB,B) 

= Hcocf(B) m -"- 1 • A, if deg(A) > deg(B) + 1. 

• Case 3: (A < i < yu). 

SubRes,(A,S) = 0. □ 

Lemma 7.7.2 Let S be a commutative ring with identity, and A(x) and 
B{x) be univariate polynomials of respective positive degrees m and n with 
coefficients in the ring S, and let 



A = min(m, n) and ^ = max(m, n) — 1, 
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as before. Then for all i, (0 < i < jj,) 



SubRe Si (A,B) = (— l)^ m ~'^™~^SubResi(B, A). 



PROOF. 

There are three cases to consider: 



1. For all < i < A 



Matrix (. 



x 



,m—i—l 



B, . . . , xB, B, x 



can be obtained from the matrix 

Matrix {x n - ;i - x A, xA, A, x^^A, ...,xB,B) 
by (m — i)(n — i) row transpositions. 

2. For i = A then (m — n)(n — n) = and 

SubReSi(A,B) = SubRes^B, A). 

3. Finally, for all A < i < fj,, 

SubResi(A, B) = SubRes^B, A) = 0. □ 

Lemma 7.7.3 Let S be a commutative ring with identity, and A(x), B(x) 
univariate polynomials of respective positive degrees m and n with coeffi- 
cients in the ring S; m > n > 0. Let 



b 5 n SubRes A _i(A, B) = b s n DetPol(A, x m ~ n B, x m ~ n - x B, . . . , xB, B) 



A = min(m, n) = n and 8 = m — n + 1. 



Then 



= DetPol(x m -"B, . . . , xB, B, b s n 

= (-1) 5 b s n PRemaindcr(A, B). 



A) 



Specifically, if S is an integral domain, then 



P Remainder^, B) = (-1) 



SubRes„_i(A,B). □ 



256 



Resultants and Subresultants 



Chapter 7 



7.7.1 Subresultants and Common Divisors 

Lemma 7.7.4 Let S be a commutative ring with identity, and A(x) and 

B(x) e S[x] be univariate polynomials of respective positive degrees m and 

n with coefficients in the ring S. Then there exist polynomials Ti(x) and 
Ui(x) <G S[x] such that for all < i < max(m, n) — 1 

A(x) ■ Ti(x) + B(x) ■ Ui(x) = SubRcs 4 (A, B), 



where 



deg(Ti) < deg(£?) — i = n — i and deg(Ui) < deg(j4) — i = m — i. 



PROOF. 

Clearly, the lemma holds trivially for all min(m,n) < i < max(m,n) — 1. 
Hence, we deal with the case: < i < A = min(m, n). 

Let us expand the following matrix (the i th Sylvester matrix) about the 
last column: 



Pi = 



l A(x) 



a i+ i at 



1,4(1) 
A(x) 



bi b () 



L B(x) 



bm ■ ■ ■ bi+i bi 
b m ■ ■ ■ b i+ i 



xB(x) 
B{x) 



> (n — i) rows 



(rn — 1) rows 



Thus, 



SubRes t (A,B) 



.n-i-l 

m — i — 1 ■ 



A(x) ■ Pi ,m+n — 2i ' Pn — i,m-\-n — 2i 



+ x m 1 B(x) ■ 

Pn — i J rl,m-\-n — 2i ~\~ ' * ' + B(x) 
= A(x) (^Pl,m+n-2iX n ~ l ~ 1 H + Pn-i,m+n-2i^ 

+ B(x) (P n -i + l,m+n-2iX m * 1 + ■ ■ ■ + P r n + n-2i ,m+n-2i ^ 

= A{x)-Ti{x) + B(x)-Ui{x); 



the coefficients of Ti(x) and Ui(x) are the cofactors of the last column of 
Pi and (thus) ring elements in S: 

deg(Ti) < dcg(£>) — i = n — i and deg(Z7j) < deg(A) — i = m — i. □ 
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Lemma 7.7.5 Let A(x) and B(x) be univariate polynomials of respective 
positive degrees m and n, over an integral domain S. Further assume that 
there exist polynomials Ti(x) and Ui{x) (not all zero) over S such that for 
all < i < max(m, n) — 1 

A(x) ■ T^x) + B(x) ■ U,(x) = 0, 

with 

deg(Ti) < n — i and deg(Ui)<m — i. 

Then 

SubResi(A,B) = 0. 

PROOF. 

The proof is a simple generalization of the corresponding lemma regarding 
resultants. 

Again, the lemma holds trivially for all min(m, n) < i < max(m, n) — 1. 
Hence, we deal with the case: < i < A = min(m, n). 

Without loss of generality, we may assume the existence of polynomials 
Ti(x) (deg(Tj) = n - i - 1) and U t (x)(deg(U l ) < m - i - 1) 
satisfying the assertion in the statement of the lemma: 

Ti( X ) = tn-i^X™-*- 1 +t n -i- 2 X n - i - 2 + --- + t , 
Ui{x) = lt m _i_i X m - i_1 + 1/ ro _i_ 2 S m_i - 2 + • ■ • + ti , 

not all tj's and ttj's zero. 

Now, expanding the equation 

A{x)-T l {x)+B(x)-U l {x) = Q, 

i.e., 

(a m x rn + . . . + oo) (t n - l - l x n - t - 1 + ■ ■ ■ + t ) 
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and equating the powers of equal degree, we get the following system of 
(m + n — i) linear equations in (m + n — 2i) variables: 

tn—i—1 ' Q>m ~\~ ' b n 

tn — i— 1 ' ^m — 1 tn — i — 2 ' &m ~\~ *U>m — i—l ' b n — 1 U m — i — 2 ' b n 

U+i • ao -\ h to ■ a i+ i + u i+ i ■ bo -\ h wo ■ h+i 

U ■ ao H + to ■ ai + Ui ■ bo -\ + uo • h 

U-i ■ ao + ■ ■ ■ + to ■ ai-i + Ui-i ■ bo + ■ ■ ■ + uo ■ 

ti ■ ao + U) ■ ai + ui ■ bo + uo • bi 
to ■ ao + uo ■ bo 

Next, multiplying the last i equations by x % , . . ., x and 1, in that 

order, and adding them together, we get a new system of (m + n—2i) linear 
equations in (m + n — 2i) variables (over S[x]): 

tn—i—l ' U"m ~t~ U m — {— i • b n 
tn-i-1 ■ drn-1 + t n -i-2 ■ 0-m + Um-i-1 ' + "m-i-2 ' &ri = 



t i+ i ■ a H h to • a i+1 + u i+ i ■ b H h Uo ■ b l+ i = 

i— 1 i 

Ux l • ao + • • • + t\x ■ x^dj + to • J]] x-'aj 
j=o j=o 

i~l i 

+ UiX 1 ■ bo + ■ ■ ■ + u\x ■ x-'bj + uq ■ x^bj = 0. 

i=o j=o 



= 

= 
= 



(7.2) 



But since the above system of equations has a nontrivial solution, we 
immediately conclude that the corresponding matrix has a determinant 
equal to zero, i.e., 

dct(SubRcs,(A,S)) =0. □ 

Lemma 7.7.6 Let A(x) and B(x), as before, be univariate polynomials of 
respective positive degrees m and n, over an integral domain S . Then, for 
all < i < max(m, n) — 1, the i th principal subresultant coefficient of A 
and B vanishes, i.e., 

PSd(A,B) = 

if and only if there exist polynomials Ti(x), Ui(x) and Ci(x) (not all zero) 
over S such that 



A(x)-T i (x)+B(x)-U i (x) = C i (x), 



(7.3) 
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where 

deg(Ti) <n — i, deg(£/j) < m — i, and deg(Ci) < i. 

PROOF. 

As before, the lemma holds trivially for all 

min(m, n) < i < max(m, n) — 1. 
(<=) Note that we can write Ti(x), Ui(x) and Ci(x) symbolically as 
Ti(x) = t n _ i _ 1 x n - i - 1 +t n _ i _ 2 x n - i - 2 + ---+t Q , 

Ui{x) = Um-i-iX™- 1 - 1 +U m - i - 2 X m - i - 2 + ---+U0, 

Ci{x) = Ci-i x 1 ^ 1 + Ci-2 x l ~ 2 H |-c , 

where by assumption not all tj's, Uj's and Cj's are zero. 

Now expanding equation (7.3), we get the following system of (m+n — i) 
linear equations in (m + n — i) variables: 

tn — i — l ' a m ~f~ ' b n — 

tn-i-1 ■ 0-m-l + tn-i-2 ■ a m + U m -i-l • &n-l + «m-i-2 ' &n = 

U+l ■ do H + to • + lt»+l • &0 H h MO • = 

U ■ ao H + to • Oi + it, • bo H + u ■ bi = (7.4) 

U-i ■ ao + ■ ■ ■ + to ■ Oj-i + Mj_i • 60 + • • • + Mo • — Cj-i = 

ti ■ ao + to • 01 + Mi • 60 + Mo • 61 — ci = 
to ■ ao + mo • bo — co = 0. 

Now consider the matrix M associated with the above set of linear 
equations: 

M T 

am am—i 

a m a m —\ 

am a m —\ 

bn b n -l 

bn bn-l 

bn b n -l 



ao 



bo 



ao 



Oi_l 



bo 



bi-i 
-1 

-1 



a 



-1 



{n-i) 
rows 



(m — i) 
rows 
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Since the system of equations (7.4) has a nontrivial solution, we sec that 
det(M T ) = 0. But since the (m + n — 2i) x (m + n—2i) principal submatrix 
of M is same as M- m+n ~ 2l \ where 



Mi 



ao 



a m ■ ■ ■ ao 
b n ■ ■ ■ b Q 



b„ 



b 



(n-i) 
rows 

(in — i) 
rows 



m+n- 



we have 

det(M T ) = (-l)Met (ikff n+n ~ 2i) ) = (-1)* PSd{A,B) = 0. 

Here, we have used the fact that: 

SubRcs,(A,B) = DetPol (Mj) 

= det (Mf m+ "- 2l) ) x l + ■ ■ ■ + dct (Mf m+M) ) . 



(=>) In the forward direction, we note that, if PSQ(A, B) = 0, then 

dct(M T ) = 0, 

and that the system of linear equations (7.4) has a nontrivial solution, i.e., 
condition (7.3) of the lemma holds. □ 

Lemma 7.7.7 Let S be a unique factorization domain with identity, and 
A(x) and B(x) be univariate polynomials of positive degrees m and n, re- 
spectively, with coefficients in S. Then, for all < i < min(m,n): 

A(x)-T i (x) + B(x)-U i {x) = 0, 

where deg(Ti) < n — i and deg(J7 i ) < m — i, if and only if A(x) and B(x) 
have a common divisor of degree > i. 

PROOF. 

Let D(x) be a common divisor of A(x) and B(x) and of highest degree 
among all such. Then A(x) and B(x) can be expressed as follows: 

A(x) = U'(x) D{x) and B(x) = T'(x) D(x), 

where, by assumption, U'(x) and T'(x) do not have a nonconstant common 
divisor. Also, note that deg(£7') = m — dcg(D) and deg(T') = n — dcg(D). 



Section 7.7 



SUBRESULTANTS 



261 



(<=) If we assume that deg(D) > i, then choose T{ = T' and t/j = —V. 
Thus 

A(a;) • T'(ar) - B(x) ■ U'(x) = 0, 
where deg(T') < n — i and deg(— [/') < m — i. 
(=>) In the other direction, since 

A(x) ■ Ti(x) + B(x) ■ Ui(x) = 0, 

we also have 

U'(x)-Ti(x)+T'{x)-Ui(x) = or U'(x) ■ T^x) = -T'{x) ■ Ui(x). 

Now, since U'(x) and T'(x) do not have a nonconstant common divisor, 
every divisor of U'(x) must be an associate of a divisor of Ui(x), i.e., 

deg(f') < deg(Ui) <m-i. 

In other words, 

deg(U') = m - deg(.D) < m - i => deg(D) > i. □ 



Lemma 7.7.8 Let S be a unique factorization domain with identity, and 
A(x) and B(x) be univariate polynomials of positive degrees m and n, re- 
spectively, with coefficients in S. Then, for all < i < min(m, n), the 
following three statements are equivalent: 

1. A(x) and B(x) have a common divisor of degree > i; 

2. (V j < ij SubReSj(A, B) = ; 

VSCj{A,B) - . 



3. 



(Vj<») 



PROOF. 

[(1) =* (2)] 

Since A and B have a common divisor of degree > i, (i.e., A and B have a 
common divisor of degree > j, for all j < i), we have, for all j < i, 

A(x) ■ Tj(x) + B(x) ■ Uj(x) = 0, 

where deg(Tj) < n — j and deg(Uj) < m — i 

=> (V j < ij SubReSj(A,B) = . (Lemma 7.7.5) 



[(2) ^ (3)] 

This holds trivially. 
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[(3) =►(!)]_ ^ 

The proof is by induction on all j < i. 

• Base Case : Clearly, PSC (A, B) = implies that Resultant(A, B) = 
and that A and B have a common divisor of degree > 

• Induction Case : Assume that the inductive hypothesis holds for 
j — 1, and we show the case for j > 0: 

PSCj(A,B) = 

and A and B have a common zero of degree > j — 1 
=* (3 deg(Q) < j) [A(x) ■ Tj(x) + B(x) ■ Uj(x) = Cj(x) 

de g( T j ) < n - .h deg(L^ ) <m-j 

(But since A and B are both divisible by a polynomial of degree > j, 
so is the polynomial C 3 ;thus, implying that Cj(x) — 0.) 

=> A(x) ■ Tj(x) + B(x) ■ Uj(x) = 

de g( T j) < n - J> dc g(^j) <m-j 
A and i? have a common divisor of degree > j. □ 

Corollary 7.7.9 Let S be a unique factorization domain with identity, 
and A(x) and B(x) be univariate polynomials of positive degrees m and n, 
respectively, with coefficients in S. Then, for all < i < min(m,n) 7 the 
following three statements are equivalent: 

1. A(x) and B(x) have a common divisor of degree = i; 

2. (V j < SubReSj(A, B) = A SubRe Sl (A, B) ± 0; 

PSCj(j4,£)=o1 A PSQ(A,5) ^ 0. □ 



3. 



(Vj<i) 



7.8 Homomorphisms and Subresultants 

Let S and S* be commutative rings with identities, and <f>: S — > S* be a ring 
homomorphism of S into S 1 *. Note that <f> induces a ring homomorphism 
of S[x] into also denoted by <j>, as follows: 

: -» 5* [a:] 

: a m x m + a m ^ix m ~ l H h a 

' ^ ^(a m )a; m + <Ham-i)z m_1 + ■■■ + cf>(a ). 
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Lemma 7.8.1 Let S be a commutative ring with identity, and A(x) and 
B{x) be univariate polynomials of respective positive degrees m and n with 
coefficients in the ring S, as before. 

A(x) = a m x m + a m -i x m_1 + • • • + ao, and 
B(x) = b n x n + b n _ 1 x n - 1 --- + b , 

where 

deg(A) — m > and deg(B) = n > 0. 

If 

deg((f>(A)) = m and deg(<p(B)) = k, (0<k<n), 
then for all < i < max(m, k) — 1 

0(SubRes 4 (A,5)) = <f){a m ) n - k SnhRes l {(j){A),(t){B)). 

PROOF. 

Let 

^ = max(m, n) — 1, A = min(m, n) 
\i! — max(m, k) — 1, A' = min(m, k). 

Clearly A' < A. 

• Case A: For i (0 < i < A'), and thus i < A. 

<j>(SubReSi(A, B)) 

= DctPoljV 1 - 4 - 1 ^), • • ■ , xcj){A), cf>(A), 

x"^- 1 ^), ...,xcf>(B), 0(B)) 

= <j)(a m ) n - k DctPol^- 4 -V(^), • • • , x<S>{A), 4>{A), 

x m -*-^(B), ...,x<t>(B), 0(B)) 
= <t>(a m ) n - k SubResj(0(A),0(S)). 

• Case B: For i = A' there are two cases to consider: 

— (Subcase I) A' = k, and thus A' < A, and i < A. 
0(SubRcs t (,4,B)) 

= DctPolf .T"- fc -V(^), • • ■ , x<j>{A), (f>(A), 
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x m ~ k ~ . . • , x<j>(B), <j>(B) 



= 4>{a m ) n ~ k DetPol^a; m - fe -V(S), . . . , x<j>{B), <f>(B) 

= <j>(a m ) n - k SubRe Si (ct>(A),<f>(B)). 
Note that degO(A)) > deg(^(S)) + 1. 

(Subcase II) A' = to, and so A' = A and deg(A) < deg(£>) + 1. 
<f>(SubResi{A,B)) 

= DctPol^"- m -V(^), ■ • • , x<f>(A), <t>(A^j 



= 4>{a m ) n - k DctPol^ fe - m -V(-4), • ■ • , x<t>(A), <j>{A) 
= <f>{a m ) n - k SuhRc Sl {^A),^B)). 
Case C: For all i (A' < i < (jf). 
</>(SubRes 4 (A, B)) 

DetPol^x™- 4 - 1 ^^), ■ ■ ■ , xcp(A), <j>(A), 

x™- 1 </>(#), ■ ■ ■ , x<j)(B), <p(B)^j , if i < A 
DetPoHV"-™- V(-B), . . . , x<j)(B), </>(B)), if i = n = A 

, 0(0), if A < i < /i. 

= 

- 0(a m )"- fc SubRc Sl (0(A),0(B)). □ 

Corollary 7.8.2 Let S be a commutative ring with identity, and A(x) and 
B(x) be univariate polynomials of respective positive degrees m and n with 
coefficients in the ring S, as before. 

A(x) = a m x m + a r „_i x m_1 + • • • + ao, and 
B(x) = b n x n + b n _ 1 x n - 1 +--- + b . 

Let 

deg(A) = to > 0, 

deg(B) = n > 0, 
deg(<£(A)) = I, (0 < I < m) and 
deg(4>(B)) = k, (0 < k < n); 

t/ien /or all < i < max(Z, fc) — 1 



= < 



Section 7.9 



SUBRESULTANT CHAIN 



265 



1. if I = m and k = n, then 



0(SubRe Sl (A,B)) = SubRcs, (0(A), <j>{B)); 



2. if I < m and k — n, then 



4>{SubRes t (A, B)) 



m 



■SubRe Si ((j)(A),<j>(B)); 



3. ifl = m and k < n, then 



4>{SubReSi(A, B)) 



n 



k 



■ SubResi {(j){A), 0(B)); 



4- if I < m and k < n, then 



<f>{SubReSi(A, B)) = 0. 



PROOF. 

We will show case (2). Case (3) is symmetrical, and the other cases are 
immediate. 

SubRes 4 (A,B) = (-l) (m_i)(n_i) SubResi(B,i4) 
0(SubRes;(B,A)) = <p(b n ) m ~ l SubRe Si (0(B), 0(A)) 
SubRcs, (0(B), 0(A)) = (-l) (m - l)(n - l) SubRe Sl (0(A),HB)) 

and therefore 

0(SubRc Sl (A,B)) 

= (_i)( m -*)(n-*) 0(6„) m -' (_i)("»-0(n-0 SubRes,(0(A),0(B)) 
= <P(b n ) m - [ SubRe Si (0(A),0(£)). □ 

7.9 Subresultant Chain 

Definition 7.9.1 (Subresultant Chain and PSC Chain) Let S be a 

commutative ring with identity and let A(x), B(x) € S[x] be two univariate 
polynomials with respective positive degrees m and n2, n\ > 

A{x) = a ni x ni + a ni ^ix ni ^ 1 + h a , deg(A) = m > 0, and 

B(x) = &„ 2 x" 2 +6„ 2 _ 1 x™ 2 - 1 + --- + 6 , deg(B)=n 2 >0. 



Let 




n\ — 1, if ni > n 2 , 
n 2 , otherwise. 
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The sequence of univariate polynomials in S[x] 

S n +1 = A, 
S n = B, 

S„_i = SubRes„_i(A ;J B), 

S = SubRcs (A,5) ^ 

is said to be the subresultant chain of A and B 
The sequence of ring elements 

PSC n+ i = 1, 

PSC„ = NHcoef(S„), 
PSC„_! = NHcoef(5„_i), 

PSCi = NHcoef(Si), 
PSCo = NHcoef(5 ) ^ 

is said to be the principal subresultant coefficient chain of A and B. 

By NHcoef, here, we denote the "nominal head coefficient" of a poly- 
nomial, i.e., the coefficient associated with the highest possible degree the 
polynomial may have — the so-called "nominal degree." D 

Definition 7.9.2 (Defective and Regular Subresultant Chain) A 

subresultant chain is said to be defective if any of its members is defective, 
i.e., for some (0 < i < ^i) 

r = &eg(Si) < i; 

otherwise it is regular. □ 

In order to understand the relation between subresultant chain and 
PRS's (polynomial remainder sequences), particularly the subresultant PRS, 
we need to explore the gap structure of a subresultant chain, which occurs 
when the subresultant chain is defective. This will be formally described 
by the subresultant chain theorem in the next section. However, in this 
section, we will simply state the theorem, provide simple intuitions behind 
the theorem and then go on to prove some important results about the 
relations that exist between subresultant chain and PRS's. 
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\ 



Case 1 
deg(A) > deg(.B) 



io 



Case 2 
deg(A) = dcg(B) 



Figure 7.1: Sub-resultant chains and their gap structures. 



In Figure 7.1, we display the gap structure ol a subresultant chain by 
diagrams in which each rectangle of width (i + 1) denotes a polynomial of 
degree i. In each case, the top-most rectangle denotes the polynomial A 
of degree n\ and the one below it denotes the polynomial B of degree ri2- 
Loos [134] attributes this pictorial representation to Habicht. 

We begin with the following definition of the block structures of a sub- 
resultant chain: 

Definition 7.9.3 (Blocks of a Subresultant Chain) A subresultant 
chain can be divided into blocks of (consecutive) subresultants such that if 

(S u Si-u Sj+i, Sj), n + 1 > i > j > 0, 

is a block, then, we have the following: 

1. Either j = and Si = Si-i = ■ ■ ■ = Sj+i = Sj = 0, 

(This is the last block in which each subresultant is zero; this is the 
so-called zero block. Note that, in this case, Si_\ ^ 0. Further, there 
can only be at most one such block.) 

2. Or Si 0, Sj ^ 0, Si ~ Sj and 5j_i = • • • = S j+1 = 0. 

(This is a so-called nonzero block. In this case, Sj is always regular 
and if i > j, then Si is defective.) □ 
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Thus, every defective subresultant Si of degree r corresponds to a unique 
regular subresultant Sj, j = r, both belonging to the same nonzero block, 
and are similar. As an immediate consequence of the subresultant chain 
theorem, we will see that any subresultant chain can be partitioned into a 
sequence of blocks, of which possibly the last one may be a zero block. It 
then follows that there cannot be two consecutive nonzero defective subre- 
sultants. 

We write the nonzero blocks of a subresultant chain as follows: 

So, Si, . . . , Si y 

The first subresultant in the i th nonzero block will be called the top element, 
S^(i) (possibly, defective) and the last subresultant, the bottom element, 
Sj^(j) (always, regular). The PSC, can be defined, similarly. 

Let 

d(i) = deg(S^ (i) ) 

e(i) = dcg(S 4(l _ 1) )-l = rf(z-l)-l 
S i+ i = dcg(S 4(l _i ) ) - dcg(S 4(i )) + 1 
= d{i - 1) - d(i) + 1. 

At this point, it is useful to state the subresultant chain theorem and 
recast it in terms of our notations in the context of the block structures: 

Theorem 7.9.1 (Subresultant Chain Theorem) Let S be an integral 
domain and let 

\S n +l, S n , S n -l, So) 
be a subresultant chain of S n+ \ and S n in S[x] (deg(S„ + i) > deg(S„)). 

1. For j = 1, . . ., n, if Sj + i and Sj are both regular, then 

(—Rj + i) 2 Sj-i = P Remainder (Sj+i, Sj) . 

2. For j = 1, . . ., n, if Sj + \ is regular and Sj is defective of degree r 
(r < j), then 

Sj-2 = ■ ■ ■ = S r +i = 0, 
Rcoef(S j ) j - r Sj, r>0, 
P Remainder (Sj+i, Sj) , r > 1. □ 

The intuition behind this theorem can be seen from the pictorial de- 
scriptions of the subresultants given in Figure 7.2. 

If we could write Sj + \ and Sj symbolically as polynomials of degrees 
(j + 1) and j respectively, then the k th subresultant (symbolically) would 



Sj-i - 

(Rj+iy s r = 

(-Rj+l) 3 T+2 S r -1 = 
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be given by the determinant polynomial of a matrix Mk whose top (j — k) 
rows would come from the coefficients of Sj+i and the bottom (j + 1 — k) 
rows would come from the coefficients of Sj. However, in order to obtain 
the k th subresultant (numerically) of the polynomials Sj+i and Sj, we have 
to eliminate the last (j — r) rows corresponding to Sj+i from the Mk (in 
the upper half) and force the entries corresponding to (j — r) higher-order 
coefficients of Sj to vanish in Mk (in the lower half). 

1. If r < k < j, then j — r exceeds j — k and the matrix Mk would have 
O's on the main diagonal, thus making its determinant polynomial 
equal 0: 

Sj-i = Sj-2 = ■ ■ ■ = S r+ i = 0. 

2. If k = r, then j — r equals j — k and the main diagonal of Mk would 
have nonzero head coefficients of Sj+i and Sj and its determinant 
polynomial would be a polynomial similar to Sj: 

5V ^ S j . 

3. If k = r — 1, then evaluating the determinant polynomial of Mk, we 
see that 

SV-i - DetPol(Sj + i, x j+1 ~ r Sj, . . . , Sj) 
<~ P Remainder (Sj + \, Sj). 

However, one caveat with the above line of reasoning is that it uses the 
false premise 

4>(SubRes k (A,B)) = SubRes fe (4>(A), 4>(B)). 

for an evaluation homomorphism <j>. The falsity of such a statement has 
been indicated in Corollary 7.8.2. A more careful and rather technical proof 
for the subresultant chain theorem is postponed. 

Corollary 7.9.2 Let S be an integral domain and let 

5b, Si, . . . , Si ^> 

be a sequence of nonzero blocks of a subresultant chain of S n +i and S n in 
S[x] (deg(S n+ i)>deg(S n )). 
Then 

(Rt).(i-i)) 5 ' +1 2 -%(») = Hcoef (Sft(i)) S ' +1 2 
(- R Hi-i)) St+1 s 1t(i+i) = P Remainder (^(i.!),^)) . 
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j - k 



j-k + 1 



j — r terms of Sj = 
Case 1 




j _ r 



j-r + l 



j — r terms of Sj = 
Case 2 




j-r+l 



j-r + 2 



j — r terms of Sj = 
Case 3 

Figure 7.2: Intuitive arguments for the subrcsultant chain theorem. 
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PROOF. 

Simply observe that if we let 

Sj+i = >%(i-i) and Sj = SY(»)> 

then 

S r = 5*4(1) and S r -i = S^( i+1 ), 

and 

Rj+i = and Rj = 0, 

R r = R^(i) and R r -i = 0. 

Also note that 

j-r = d(i - 1) - d(i) - 1 = 6 i+1 - 2. □ 
Hence, we see that 

■%(») ~ and 
SfrCi+i) ~ PRemainder(5' 4(t _ 1) ,S' t ( l) ) 

=> 

~ PRcmainder(5'jj. (i _ 1) ,S' 4 ( l) ) . 
The corollary below follows: 

Corollary 7.9.3 Let S be an integral domain, and let F\(x), F 2 (x) € S[x] 
(deg(Fi) >deg(F 2 )). 

Now, consider their polynomial remainder sequence: F\, F 2 , Fk 
and their subresultant chain, with the following sequence of nonzero blocks: 

So, Si, . . . , Si y 

Then the elements of the polynomial remainder sequence are similar to 
the regular subresultants, in their respective order, i.e., 

1. k = I + 1. 

& <%(i) ~ F i+1 . □ 

In fact, a much stronger result can be shown. Recall the definition of a 
subresultant polynomial remainder sequence of two univariate polynomials 
Fi and F 2 (deg(F) > deg(F 2 )) over a UFD, S: 



Ai 


= o, 


A 2 


= dcg(F 1 )-dcg(F 2 ) + l 


h 


= 1, 


b 2 


= Hcoef(F 2 ) 
= (6 2 ) A2 " 1 


"01 


= 1, 


02 



and 

/3i = /?2 = 1 and /? 3 = (-l) A2 

Furthermore, 



272 



Resultants and Subresultants 



Chapter 7 



• For i = 3, 



k 



PRemaindcr(i ;l i _2, Fj-i) 



A 4 



deg(Ji_i) - deg(fi) + 1 
Hcoef(Fi) 




• For i = 3, 



fc- 1, 



(-l) Ai (^-i) 



Ai-l 



Theorem 7.9.4 Let 5 tea C/.FD, and let F x {x) , F 2 {x) e 5[x] w^dcg(Fi) > 
deg(F 2 ). 



A^ow;, consider their subresultant polynomial remainder sequence: F\, 
F 2 , Fk and their subresultant chain, with the following sequence of 
nonzero blocks: 



Then 

1. F i+1 = Sft(i), i = 0, . . ., fe — 1. 

2. Vi+i = » = 0, .. ., fc- 1. 

PROOF. 

First, as a consequence of Corollary 7.9.3, we see that 



The rest of the proof is by induction. 
Claim 1 : 

(1) Assume that both (1) and (2) hold for all j = 0, . . ., i, (i > 2), wc 
shall prove (1) for i + 1. 




A, = Si 



i = 0, . . . , k. 



A+2 F i+2 

= PRcmainder (F i7 F i+ i) 
= PRcmainder (S^^i) , 
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= (lT^) A ' 

= (-i) A - +1 (^) Al 2 (^) Ai+1 s t(i+1) 

= (-i)^ (^) Al+1 ~ 1 ^ (^) Al 2 

= (-l) A -+i (^) Al+1 ^ 1 6i -% + i) 
Since we are working over a UFD, we can clear p i+2 from both sides to 

get 

Fi+2 = S^j+i). 

Claim 2: 

(2) Assume that (1) holds all j = 0, . . ., i and (2) holds for all j = 0, . .., 
i — 1, (i > 1), we shall prove (2) for i. 
Note that 

^i+i = ipi I — J 

Hcoef(F i+ i) A< + 1_1 

Hcoef(^( i) ) A '+ 1 - 1 

But since, 

(-%(*- i)) A * +1 ■%(») = Hcocf (S , t(j) ) A ' +1 2 S* t(i ), 
equating the coefficients, we have 

(-Rjj.(i-i)) Al+1 R Ui) = Hcoef (S , t(i) ) A * +1 

Hence 

as we are working over a UFD. 

In order to complete the proof, we need to take care of the following 
base cases: 

1. i = 0: 

Fi = S n+ i = Sq(o) and tpi = 1 = R n +\ = R^oy 
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2. i = l: 



By claim 1: 



3. i = 2: 



F2 — S n — S^(i) . 

1p2 = %!)• 



P Remainder^, F 2 ) 
3 = R 

P Remainder ( F\, F 2 ) 

PRemainder(S' 1 y(o), 

PRemainder(S'^(o), 



(-^(0)) A2 



%2)- 



The rest follows by induction, using the claims (1) and (2) proven ear- 
lier. □ 



7.10 Subresultant Chain Theorem 

Here, we shall provide a rigorous proof for the subresultant chain theorem. 
The proof begins with Habicht's theorem, which considers subresultant 
chains of two univariate polynomials of degrees (n+ 1) and n, respectively, 
and both with symbolic coefficients. 

The rest of the proof hinges on a generalization of Habicht's theorem, 
obtained by applying evaluation homomorphisms. This generalization di- 
rectly leads to a proof of the subresultant chain theorem. 

7.10.1 Habicht's Theorem 

Consider two univariate polynomials 

A(x) = a m x m + a m _ia; m_1 H h a , deg(A) = m > 0, and 

B(x) = b n x n + b n - 1 x n ~ 1 +... + b , deg(B) = n>0, 

of positive degrees, with formal coefhcients a m , a m -i, . . ., ao and b n , b n -\, 
. . ., &0: respectively. 

We shall treat A{x) and B{x) as polynomials in x over the ring 



Z[D>, . . . K , . . . ,>/]. 
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Thus the subresultants of A(x) and B(x) arc in 

(Z[D>, . .. ,3^, k ,. .. [a;]. 
In this section, we assume that 

deg(A) = m = n + 1. 
Thus the subresultant chain of A and -B is 

^ SVi+i = A 

S n -x = SubRes„_i(A,B), 

Si = SubResi(A,B), 
So = SubRcs (A,B) ^. 

Lemma 7.10.1 Lef A(a;) and 6e fwo univariate polynomials of re- 

spective degrees n + 1 and n, urci/i formal coefficients a n+ i,a n ,. . .,ao and 
b n , b n -i, bo, respectively. Then 

1. SubRcs„_i(A,5) = PRcmaindcr(A, B) 

2. Fori = 0, n- 2, 

& 2(n-»-i) SubResi(A,B) = SubRes i (B,PRemainder(A,B)). 

PROOF. 

First, note that 

bl SubRes„_i(A,B) 

= bl DetPol(A, xB, B) 

= bl(-l) 2 DctPoKxB, B, A) 

= bl PRcmaindcr(A, B) 

Since there arc no nonzero zero divisors in Z[D>, . . ., aa, b n , • • ■, bo], wc 
may clear the bl from both sides to find 

SubRes„_i(A, B) = PRemainder(A, B). 

Secondly, 

bl {n ~ i] SubRc Sl (,4,B) 
= bl (n ~ l) DctPol^"- 1 - 1 ^, . . . , xA, A, x n -*B, . . . , xB, B) 
= DctPo^x™- 4 - 1 b 2 n A, xblA, b\A, x n ~ l B, ...,xB,B) 
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[But b\ A = (qix + qa) B + R, where R = P Remainder (A, B) and deg(i?) = 
n- 1.] 

= DctPolO™- 4 - 1 .??, ...,xR,R, x n -'B, . . . , xB, B) 

= DetPoKx""^, . . . , xB, B, x^^R, . . . , xR, R) 

= 1 • b\ DctPol(x™- 4 - 2 B, . . . , xB, B, x'n-^R, xR, R) 

= b 2 n SubReSi{B,R). 

Since b n ^ and not a zero divisor in Z[D>, . . ., a , 6„, . . ., b ], and 
i? = PRcmainder(A, B), we have 

6 2( ™"^ 1) SubRes J (^, S) = SubResi(S, PRemainder(A, B)). □ 

Theorem 7.10.2 (Habicht's Theorem) Let A(x) and B(x) be two uni- 
variate polynomials of respective degrees n+1 andn, with formal coefficients 
a n+ i,a n ,. . .,a a and b n , b n -i, . . ., b Q , respectively. 

Let (S n+ i, S n , . . ., So) be the subresultant chain of A and B. Let Rj 
(0 < j < n + 1) be the j th principal subresultant coefficient of A and B. 
Then for all j — 1, . . . , n 

a) Rj +1 Sj-i = PRcmainder(5j+i, Sj), and 

b) R% l) S t = SubRc Sl (5, +1 ,^), fori = 0,...,j-l. 



PROOF. 

The proof is by induction on j: 

• Base Case : (j = n). 
Rn+i = 1; therefore, 

a) S n -\ = SubRes n _i(5„ + i, S n ) (by definition) 

= PRemainder(S'„ + i , S n ) (by the previous lemma) 

b) Si = SubRcSj(5„+i, S n ) (by definition) 

= SubResj(A,B) i = 0, . . . ,n - 1. 



• Induction Case : Assume that the inductive hypotheses hold for 
n, n — 1, . . . , j + 1 and consider the case when j < n. 



b) R j+2 R j+1 bi 
= Rfi~ i] SubRcs, {S j+2 , S j+1 ) 

= SubResj(.Sj + i, PRemainder(S'j + 2, Sj + i)) (by the previous 

lemma) 

= SubRes,(<Sj + i, R] + 2 Sj) (using part a) inductively) 
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= DctPol^.T J * xSj + i, Sj + i, 

= R% l+l) DetPolf^'-^ 1 ^!, xSj+u S j+1 



= R% i+1) SubRes^+i , S,-), 
where the terms cancel from both sides to produce 

Rf+V ] s i = SubRcs, (S j+1 ,Sj), for i = 0, . . . , n - 1. 
a) In particular, for i = j — 1 we get 

Rj+iSj—i = SubReSj—i (<Sj_|-i , ) 

= PRemainder(S f :(+ i , D 

7.10.2 Evaluation Homomorphisms 

Let S* be a commutative ring with identity, and <f> an evaluation homo- 
morphism defined as: 

Z[a m , . . . , ao, b n , . . . , bo] — > S* , 
di i— > a*, for i = 0, . . . , m, 

bjt-^b*, for j = 0, ...,n, 

0^0, 
fc i — > 1 H hi. 

S v ' 

fc— times 

Lemma 7.10.3 Let ^4(.t) and -B(z) 6e two univariate polynomials of re- 
spective positive degrees n+l and n, with formal coefficients a n+ i,a n ,. . .,ao 
and b n ,b n _i,. . .,b , respectively. Let 

S n +1 , S n , S n - 1 , . . . , Si , Sq 

be the subresultant chain of A and B. Let <f) be the evaluation homomor- 
phism defined above. If <fi(Sj+i) is regular and (f>(Sj) is defective of degree 
r, then 

1. 4>(R J+l f HS.-i) = 4>(Rj+i) A m-2) = ■■■ 

= ^{R 3+1 f { - J - r - 1) 0(S r+1 )=O. 



278 



Resultants and Subresultants 



Chapter 7 



2. <P(R j+1 ) 2{j - r) cf>(S r ) 

= (Hcoef(^(S'j + i))Hcoef(0(S , j))) J T <P( s j)- 

3. 4>(R j+1 f {j - r+1) RcoemS,)y- r+2 0(S r _!) 

= (- l)^ r+2 Hcocf {<f>(S j+1 ) ) j - r Hcoef(</>(^ ) ) j " r+2 
x PRemainder(^)(6'j + i), (j>(Sj)). 

PROOF. 

Since 

deg(^(5 j+1 ))=j + l, deg(^))=r, 
and, for alH (0 < i < j), 

Rfl^Si = SubResj(S,- + i,Sj), 

we see that 

tiRi+if^HiSi) = ^(SubResiCSj+i.Sj)) 

= 0(Hcocf(5 J+1 )) J " , 'SubRes,(0(5 J+ i), 0(5,-)) 
= Hcoef(0(S' J+ i))^ r SubRes,(0(5 J+ i), 0(^0). 

But 

SubRes, (</>(S, +1 ),0(S*,)) 

'0, if r < i < j; 



Hcoef(0(5j)) J r 4>(Sj), if i = r, and deg(<£(Sj)) + 1 

< deg(<X^ + i)). 



For i = r — 1, 

Kcoei(<p(S 3 )y- r+2 SubResi 0(5,-)) 

= (-l) j_r+2 Hcocf(</>(S,-)) J '~ r+2 PRcmainder(0(S J+1 ),0(S* 3 )). 

Therefore, we have the following: 

1. For i = r + 1, . . ., j — 1, 

0(i2 J -+i) 2(, '- i) 0(Si) = 0. 

2. For i = r, 

0(i2 j+ i) 2 «- r ) 0(5 r ) = (Hcoef(0(S', +1 ))Hcoef(0(S*,)))^ r 0(5,). 
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3. For i = r — 1, 

0(i? J+ i) 2 °- r+1) Hcoei(«X^-)r r+2 0(5 r _i) 

= (-l)^ r+2 Hcoef(<X^+i)) j ~ r Hcoef(0(S' J -)) J '~ r+2 
x PRemainder(^(Sj+i),0(S , j)). □ 

Corollary 7.10.4 Let A(x) and B(x) be two univariate polynomials of re- 
spective positive degrees n+1 and n, with formal coefficients a n+ i,a n ,. . .,ao 
and b n ,b n -i,. . .,bo, respectively. Let 

Sn+li S n , S n —l, . .., Si, Sq^J 

be the subresultant chain of A and B. Let <f> be an evaluation homomor- 
phism from Zp«+j^, . . ao, b n , . . ., bo] into an integral domain S* . 
If 4>{Sj + i) is regular and <j){Sj) is defective of degree r, then 

1. <KSj-i) = 0(Sj_ 2 ) = • • • = HSr+i) = o. 

2. If j = n, then 

4>(S r ) = (Hcocf(0(5„ +1 ))Hcoef(0(5„))J 0(S n ). 

If j < n, then 

<i>(R j+ i) {j - r) <f>(S r ) = RcocmS 3 )y- r 0(5,), 
since (j)(Sj + i) is regular and (f)(Rj + \) = Hcoef((/>(Sj + i)). 

3. If j = n, then 

0(Sr-i) = (-Hcocf(0(5„+i))J PRemaindcr(0(5 Il+ i), 0(5„)). 
If j < n , then 

cj){-R j+l ) j - r+2 <j){S r -x) = PRcmainder(<£(S j+ i),0(S,-)), 
since (p(Sj + i) is regular and <j)(Rj + i) = Hcoef(0(Sj + i)). □ 

7.10.3 Subresultant Chain Theorem 

Now we are ready to prove the main theorem of this section, the subresultant 
chain theorem. 

Let S* be an integral domain, and A* (x) and B*(x) be two univariate 
polynomials in S*[x] of respective positive degrees n\ and n 2 (ni > n 2 ): 



A*(x) 
B*(x) 



+ Oq , and 
+ b* Q . 
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If m > ri2, then we set ni = n + 1 and specialize 

K = ■ ■ ■ = &; 2+1 = o. 

If m — n 2 , then we set n 2 — n and specialize 

<+i = o. 

Therefore, 

_ { ni — 1, if rix > n 2 , 
\n 2 , otherwise. 

The next theorem connects the sparsity in the head coefficients of S* with 
the gap structure of the chain. 

Theorem 7.10.5 (Subresultant Chain Theorem) Let 

o n+ i, o n , o n _i, o _ 

be a subresultant chain o/5* +1 and 5* m 5*[x]. 

1. For j = 1, . . ., n, if S* +1 and S* are both regular, then 

(-R* +1 ) 2 = PRcmaindcr (S* +1 , S*) . (7.5) 



2. For j — 1, . . ., n, if S* +1 is regular and S* is defective of degree r 
(r < j), then 

Sj_i = Sj-2 = ■ ■ ■ = S* +1 = 0, (7-6) 
(R* j+1 ) j ~ r S* = Hcocf(S**)''" r 5*, r>0, (7.7) 

(-R* j+1 ) j ~ r+2 = PRcmaindcr (S* +1 , S*) , r> 1. (7.8) 

PROOF. 

Since the first case is a simple consequence of Habicht's theorem (Corol- 
lary 7.10.4), we will focus only on the case when S* +1 is regular and S* is 
defective of degree r (r < j). Let 

A{x) = a n+ i x n+1 + ■ ■ ■ + a , and 
B(x) = b n x n + --- + &0, 

be two univariate polynomials with formal coefficients a n +i, • ■ •, ao and b n , 
. . ., bo, respectively. Let 

SVi+ij S n , S n -i, So 
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be a subresultant chain of A and B (in Z[D k +f, • • ., »o, b n , . . ., 60]) with 
the principal subresultant coefficient chain: 



1, R nj Rn-1, ■ ■ ■ , Ro 



We define two evaluation homomorphisms <pi and </> 2 , corresponding 
respectively to the two cases (1) m > n 2 (i.e., n = n\ — 1) and (2) m = n 2 
(i.e., n = n 2 ). 

• Case 1 : If n\ > n 2 (i.e., n — ri\ — 1), then 

• • • , do, b n , ■ ■ ■ , bo] — > «5* 
0^ m a*, for i = 0, . . . ,n + 1, 

&j 1 ► 0, for j = ra 2 + 1, . . . , n, 



bj ^ b*, 

1 1-> 1, Oh^O. 



for j = 0, . . . ,n 2 , 



Case 2 : If n\ = n 2 (i.e., n = n 2 ), then 



Z[a„+i, . . . ,a ,b n , . . . ,b ] — ► £* 
a«+i i-> 0, 

a, a*, for « = 0, . . . , n, 

bjt-^b*, for j = 0, . . . ,n, 

1 i-> 1, 0^0. 



The following observations are immediate consequences of Corollary 7.8.2: 
• In either case (i.e., k = 1, 2), for alH (0 < i < n), 
0fc(Si) = fe (SubRes,(A,S)) 

^(an+i^-^SubResi^i^),^^)), if k = 1, 
02(&„) Il+1 -" 1 SubRes 4 (0 2 ( J 4),0 2 (B)), if fc = 2, 
(< +1 )"-" 2 S*, iffc = l, 

(6;)"-" 1+1 5*, iffc = 2. 

In either case (i.e., k = 1, 2), for alH (0 < i < n + 1), 

1. (f>k{Si) is regular if and only if S* is regular, and 

2. (f>k{Si) is defective of degree r if and only if S* is defective of 
degree r. 
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Thus, for all j = 1, . . ., n, if S* +1 is regular and S* is defective of 
degree r, then for both k = 1,2, (j)k{Sj+i) is regular and <j>k(Sj) is 
defective of degree r. 

In either case (i.e., k = 1, 2), if S* is regular, then for alH (0 < i < n), 

MRi) = 0fc(Hcoef(S i ))=Hcoef(0 fc (S i )) 
« +1 )"~" 2 iffc = l, 

{b* n ) n - ni+1 R*, iffc = 2. 

Now, we are ready to prove the lemma. We first consider the special case 
when j = n, and then consider the general case j < n. 

• Case 1 : j = n. 

That is, S* l+1 is regular and 5** is defective of degree r < n. Thus 
S* +1 = a* n+1 x n+1 + ■ ■ ■ + do, 

s* n = b* r x r + --- + b*. 

Thus n\ = n + 1, and n 2 = r < n, and the case 1 (i.e., m > n 2 ) holds. We 
also see that 

A* = min(n + 1, r) = r. 
Hence, by the definition of subresultant chains, we get equation (7.6): 

n-1 — D n-2 — ' ' ' — J r+ i — U, 

equation (7.7): 



(K + i)"~ r = 5 r * (since R* n+1 = 1) 
= Hcocf(5; i )" +1 - r - 1 S* n 



and equation (7.8): 

(-K + iY l ~ r+2 S* r _, = (-l)»-''+ 2 (since < +1 = 1) 

= (-l)™-''+ 2 (_i)n+i-r+i PRcmain dcr (S* n+1 ,S* n ) 

(using Lemma 7.7.3) 
= PRemainder (S^ +1 , S 1 *) . 

• Case 2 : j < n. 

That is, for fc = 1, 2, (f>k(Sj + i) is regular and <fik(Sj) is defective of degree 
r < j. By Habicht's theorem (Corollary 7.10.4): 

4>k{Sj-\) = 4> k (Sj-2) = ■■■ = 4> k (S r+ i) = 0, 
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and 



« +1 ) n " 2 s*_, = --- = « +1 ) n " 2 S r * +1 - 0, if k = 1, 



D r+1 — u > 



if fc = 2. 



Sr+l — 0. 



(6;) n - ni+1 s;_i = --- = (&;) T 

In either case, we get equation (7.6): 
S*_ 1 = Sj_ 2 = ■ 

Again by Habicht's theorem (Corollary 7.10.4), we have for k = 1, 2, 
fa(R j+1 Y- r MSr) = Hcoef (MSj)) j ~ r MS 3 ). 

Thus 

\ a n+l) \ H j+l) \ a n+l) \ 



J-r 



< +1 )"-" 2 Hcoef(5;) r (a; +1 )"- 2 S*, if k = 1, 



(6: 



* N0'-r)(n-m + l) 



= ((&;)""" 1+1 Hcocf(s*;)) (6;)"-" 1+1 s*, ak = 2. 

Thus after cancellation we have equation (7.7): 

(R* j+1 ) j ~ r S; = Hcoef (5*)^ r S*. 

Lastly, by another application of Habicht's theorem (Corollary 7.10.4), wc 
have for k = 1, 2, 



4>k(-R j+ i) 



j-r+2 



i»fc(5 r _i) = PRemainder(0 fe (5j + i),^fc(S'j)) • 



Thus 



/' / \ 71 — 71 2 \ ^ 7" "I - 2 \ TL — 71 2 

1 — ( a n+l) ^j'+l J ( a n+l) "r-1 



PRemainder ^(a* +1 )™ ™ 2 

/„* \™-™2 / * ^ (n-n 2 )(j-r+2) 



*n+i; 



x PRemainder (S* +1 , S*) if k = 1, 



(- (&;r ni+1 ~ r (&;r ni+1 S?_x 

= PRemainder ((&;)"~" 1+1 S** +1 , (&;)"-" 1+1 5 
= (6* (5* ^(™-«i+l)0'->"+2) 

x PRemainder (S** +1 , S 1 *) if fe = 2. 
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Thus after cancellation we have equation (7.8): 

(-R* j+1 ) j ~ r+2 = PRcmainder S*) . □ 

Problems 

Problem 7.1 

Devise a simple algorithm to compute the maximal square-free factor 
of a polynomial A(x) over a field K of characteristic 0. 

Problem 7.2 

Consider the resultant of two polynomials A(x) and B(x) over a field 

if: 

A{x) = a m x m + a m _ 1 x m - 1 + ••• +a , 
B(x) = 6„a;" + 6„_ix ,l - 1 + --- + 6o, 

of respective degrees m > and n > 0, respectively. 

Show that the Resultant(A, B) satisfies the following three conditions: 

1. Rcsultant(A, b ) = Vg. 

2. Resultant (B, A) = (-l)" m Rcsultant(A, B). 

3. If m < n and R{x) is the remainder of B(x) with respect to A(x) 
[i.e., B(x) = Q(x) ■ A(x) + R(X), deg(R) is minimal], then 

Resultant(A, B) = a^ m Resultant(A, R). 

Show that these three properties define the resultant uniquely. 

Hint: Note that the resultant of two polynomials in K[x] can be 
uniquely computed by using Euclid's algorithm and keeping track of the 
head coefficients. This algorithm also leads to a uniqueness proof by an 
induction on min(deg(A), deg(S)). 

Problem 7.3 

Using the results from Problem 7.2, show the following: 

Let A(x) and B(x) be two polynomials over an algebraically closed field 

L: 

A(x) = a rn x m + a m -i x m_1 H ha , 

B(x) = b n x n + b n _ l x n - 1 + --- + b , 

of respective degrees m > and n > 0, respectively. Then 
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(i) Resultant (A, B) = if and only if A and B have a common zero. 

(ii) Let 

A{x) = a m (x - cei)(x - a 2 ) ■ ■ ■ (x - a m ), 
B(x) = b n (x-/3 1 )(x-lh)---(x-l3 n y, 

then 

m n 

Resultant(A, B) = a n m b™ JJ J|(a, - /?,■)• 

i=i j=i 

Problem 7.4 

Consider a monic polynomial f(x) & K[x] (K = a field) of degree n and 
with n zeros ai, «2, ■ • «« in the field if (or some algebraic extension of 
K). Let 

f(x) = (x - ai)(x - a 2 ) ■ ■ ■ (x - a n ) 

= x n +a n - 1 x n ~ 1 A ha fe x fe H h% 

Prove that the following formulas, called Vieta's formulas, hold: 

fln-i = -(ai+a 2 -\ +a n ), 

a k = (-l)"- fe ■■■ a 'n-*)> 

ii<i2<---<i n -k 

a = (-l)"ai a 2 ■ ■ ■ ct n . 

Thus the coefficients of / are symmetric functions of the zeros. Here, 
symmetric functions are those functions that remain invariant with respect 
to the permutation of its variables. 

Problem 7.5 

Consider the following ring automorphism defined over S[xi, x 2 , . . ., 
x n ] (S — an integral domain) with respect to a permutation tt £ S n : 

II : S[x 1 ,X2,...,x n \^S[x 1 ,x 2 ,...,x n ] 

: f(xi,x 2 , ■ ■ -,x n ) 1 * /(a; w -i(i),a; w -i(2), • • . , x 7r -i ( „ ) ^ . 

The polynomial is symmetric if II(/) = / for all n e S n . 

Consider the following two classes of symmetric polynomials: 

1. Elementary Symmetric Polynomials 

s k(xij X2j • ■ • , X n ^j ^ ^ X{ ± X{ 2 • • • X{ k , k — 1, 2, . . . , Tl. 

l<ii <%2 <■ ■ ■ <ik<n 
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2. Power Sum Symmetric Polynomials 

n 

Pk(xi,X2, ■ ■ ■ ,x n ) =~y]xj, k = 1,2,. 



Prove that the following formulas, called Newton's formulas, relate the 
Pfc's to Sfc's: 

Pk -Pk-isi +Pk-2S 2 H h (-l) fe_ ViSfe-i + {-l) k ks k = 0, 

for 1 < k < n, and 

Pk -Pk-lSl +Pk-2S2 H h (-l)" _1 pfc-n+lSn-l + (-l)>fc-„S„ = 0, 

for k > n. 
Problem 7.6 

Consider the following matrix, V n , called a Vandermonde matrix: 



1 1 

Xi x 2 



x n-2 x n-2 

x n ~ 1 x"^ 1 



1 

Xn 

r n-2 
r.n-1 



Show that the determinant of V n over a field K is given by 
dct(K) = Q (xj - Xi). 

l<i<j<n 

Hint: The proof is by induction on n. As det(V^) vanishes when x n 
in the last column is replaced by Xi (1 < i < n), clearly (x n — Xi) | det(V^). 
After eliminating all such factors, it can be seen that 

n-1 



det(K) = det(K-i) JJ ( Xn ~ 



i=l 



(consider the coefficients of in the above equation). 

Using the above facts show the following: Let f(x) e L[x] be a univari- 
ate polynomial of degree n over an algebraically closed field L and with n 
roots: ai, «2, • ■ ■, ot n - Then 



n 


Pi 


P2 


■ ■ Pn-1 


Pi 


P2 


P3 


Pn 


P2 


P3 


Pi 


■ ■ Pn+1 



Discriminant (/) = 

Pn-l Pn Pn+1 ' ' ' P2n-2 

where Pk = J2ti «i (k = 1, . . 2n- 2). 
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Problem 7.7 

Prove the following identities: Let A(x), B(x) and C{x) e K[x\ be 
three univariate polynomials of degrees m, n, and p, respectively. 

1. Resultant(A(x),z-?/) = (-l) m A(y). 

2. Resultant (AS, C) = Rcsultant(,4, C) Resultant (B, C). 

3. Discriminant (AB) 

= Discriminant (A) Discriminant (B) |Resultant(A, B)\ 2 . 

Problem 7.8 

Let K be a field and A(x) <E K[x] be a monic polynomial: 



A(x) 



.x -+- a m — i x ■+■ 



The following matrix C A € K r ' 
polynomial A: 



+ ai x + a . 
is called a companion matrix of the 



C = C A 









-a 



1 





-Oi 





1 












-a-m-2 







1 

-a m -i 



Prove that the characteristic polynomial of the companion matrix C A is 
X (C A ) = dct (XI m - C A ) = A{\). 
Now, if B(x) e K[x] is another monic polynomial 

B(x) = x n + 6 n _i x™- 1 + • • • + h x + bo, 
and if we consider the following m x m matrix: 

B(C A ) = C\ + 6„_! C^ 1 + ••• + &! CU + 6 / m 
then show that 

det (.B(CU)) = Resultant (A, 5). 

Problem 7.9 

Let A (x), Ai(x), . . ., A„(x) e K[x] (K is a field). 
Let y be a new indeterminate, and consider the following bivariate poly- 
nomial: 

A(x, y) = A„_i(x) y n ~ l + ■ ■ ■ + A x {x) y + A (x), 
where A is regarded as a polynomial in x with coefficients in K(y). 
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(i) Show that the GCD of A and A n (when treated as polynomials in 
K{y)[x\) is the same as the GCD of Aq, A\, . . ., A n (in k[x\). 

(ii) Construct the Sylvester matrix M y of the polynomials A and A n 
[with coefficients in K(y)] and show that det(M y ) vanishes identically if 
and only if the polynomials Ao, A\, . . ., A n have a nonconstant divisor. 

Problem 7.10 

Let A(x,y) and B(x,y) <G K[x,y] be two homogeneous polynomials of 
degrees m and n, respectively: 

A(x,y) = a m x m + a m ^ix m ^y + ha y m , and 

B(x,y) = b n x n + b n ^ 1 x n - 1 y+--- + b y n . 

Prove that every factor of a homogeneous polynomial is also homoge- 
neous. 

Prove the following statement: 

A and B have a common factor of degree > k if and only if the 
rank of the following matrix is less than m + n — 2k + 2: 

am am—1 ' ' ' 

a m a m — 



b n b„-i 

bn b n -l 

b„ 



ao 

i ■■ ■ a 



a m a m -i ■ ■ ■ ao 
bo 

bo 



(n-k + 1) 
rows 



(m - k + 1) 
rows 



b n - 



bo 



Problem 7.11 

Consider r homogeneous polynomials A\, A2, . . ., A r in K[x, y] having 
the same degree n: 

Axix^) = a hn x n + ai,„_ia:™" 1 ?/H h a lfi y n , 

A 2 {x,y) = a 2 . n x n + a 2 , n -ix n ~ 1 y H h a 2j0 2/ n , 



A r (x,y) = a rvn x n + a rvn ^ix n x y H h a r $y r 



Consider the following matrix Si consisting of r blocks of rows, where 
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each block of rows consists of I — 



ai,n «l,n-l 

Cll,n Ol,ti-l 



5, = 



1 rows of a^.'s: 
ai,o 

ai,o 



Ol, 



&r,n— 1 
Q>r.n 



ttr n— 1 



ai,o 



«r,0 



<2r,0 



Show that the r polynomials Ai, A 2 , . . ., A r have a common divisor of 
degree > k if and only if the matrix S 2n -k has rank less than (2n — fc + 1). 

Generalize this theorem to the case where Ai, A 2 , . . ., A r have different 
degrees: ni, n 2 , . . ., n r . 

Hint: Use the fact that in order for a polynomial to be a common 
divisor of A\, A2, . . ., A r , it is necessary and sufficient that it is a common 
divisor of 

Aim + A 2 u 2 + • • • + A r u r and A\V\ + A 2 v 2 + • • • + A r v r , 

where u's and v's represent 2r indeterminates. The resultant of these newly 
constructed polynomials is related to the so-called Kronecker's {/-resultant. 

Problem 7.12 

Let Ai, A 2 , . . ., A r be r polynomials in K[x,y] (K = a field). Show 
that the number of common zeroes of the Ai's satisfy the following bound: 

\Z(A 1 ,A 2 , . . .,A r )\ < 2 (max degj^)) (max deg^A,)) , 

where deg x (A) and deg„(^4) are the degrees of A with respect to x and y. 

Hint: Use the idea of [/-resultant (Problem 7.11) once again. Also, 
show that there exist polynomials A\, A 2 , . . ., A r e K[x, y] such that 

\Z{A u A 2 ,...,A r )\ = (max deg^A;)) (max deg„(Ai)) . 

Problem 7.13 

Let S be a UFD and let F\{x) and F 2 (x) £ S[x] be two univariate 
polynomials with 

deg(Fi) < deg(F 2 ). 

Let n = max(deg(Fi), deg(F 2 )) = dcg(F 2 ). Define the subresultant 
chain of F\ and F 2 as 



S n +1 — F\, 
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S n — F2, 

S n -i = F\, 

S n -2 = SubRes„_ 2 ( J Fi,F 2 ), 



So = SubRes (Fi,F 2 ) 



and the principal subresultant coefficients as 



( 



PSC„ + i 



1 



PSC„ = NHcoef(S*„), 

PSC„_! = NHcoef(S n _i), 

PSC„_ 2 = NHcoef(S„_ 2 ), 

PSCi = NHcoef(S*i), 

PSCo = NHcocf(S'o) ) 



Also modify the definition of subresultant polynomial remainder se- 
quence by redefining Aj as 

Ai = 0, 

Aj = max(deg(Fj_i) - deg(Fj) + 1,0), i = 2, 3,..., 

{pi, ipi, fa and Fi stay just as in the text). 
Prove Theorem 7.9.4 for this case: 



Note that you will need to prove the subresultant chain theorem for this 
case also. 

Problem 7.14 

Give a direct proof for the following statement (i.e., do not use the 
identities involving determinant polynomials): 

Let S be a unique factorization domain. Let A(x), B{x) £ S[x] 
be two univariate polynomials of degree m and n, respectively, 
and a, [3 e S. Then 

PRemainder(aA(x), (3B(x)) = a/3 5 PRemainder(A(x), B(x)), 
where S = max{m — n + 1, 0}. 



deg(Fi) < deg(F 2 ). 
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Problem 7.15 

Let S be a unique factorization domain, and F\, F2, . . ., ffc be a Eu- 
clidean polynomial remainder sequence (deg(fi) > dcg(F 2 )). 

(i) Let 

A, = deg(fi_i) - deg(F,) + 1, i = 2,...,k. 

Let ci, c 2 , . . ., q, . . ., Ck be the sequence of the contents of the polynomials 
Fi's. 

For all m > I prove that 

A, A, +1 ...A m _! 1 

(ii) Using (i), show that the sizes of the coefficients in an Euclidean 
polynomial remainder sequence of two polynomials over Z grow at an ex- 
ponential rate. 

Problem 7.16 

(i) Let A be an n x n matrix over the integers Z. Prove the following 
inequality, the Hadamard inequality, 

. V2 

n / n 



\det(A)\ < jl E< 



j 

»=i \j'=i 

Hint: Consider the matrix AA T . 

(ii) Using (i), show that the sizes of the coefficients in a subresultant 
polynomial remainder sequence of two polynomials over Z grow at a (small) 
polynomial rate. That is, show that if F m is the m th polynomial in the 
subresultant polynomial remainder sequence of F\ (x) and F 2 (x) € Z, then 
the sizes of the coefficients of F m are 



0^(deg(Fi)+deg(F 2 ) -2deg(F m )) log(size(Fi, F 2 )(deg(Fi) + deg(F 2 )) 
Problem 7.17 

Let S be a unique factorization domain, and F\, F2, . . ., i*fc be a Eu- 
clidean polynomial remainder sequence (deg(i*\) > deg(F 2 )). Let 

A, = deg(fi_i) - deg(F,) + 1, i = 2,...,k, 

and 

Ci = Content(Fi), i = 1, . . . , k. 

(i) A sequence — (j3\ — 1, #2 = 1, (h, ■ ■ ■, Pk) is well-defined if and 
only if 

(fo-ipfJi 1 Pi) I c u i = 3,...,k. 



292 



Resultants and Subresultants 



Chapter 7 



(ii) Let F[ = F 1 , FJ,=F 2 , 

_ PRemamde r (Fl_ 2 ,FU) . _ _ , 
f i — , i — a, ... ,k. 

Pi 

Show that 

Fi = (c^_ 2 Pt-i 1 A) J?, i = 3,...,k, 
where ot\ = a 2 = 1 and 

A 4 _i o fl A,_ 3 +Ai_i a (Ai_ 2 )(Ai_i) . o , 
Oj = dj-2 Oj-! Pi-4 P;_ 3 P-_2 , « = 3, ...,fc. 



Solutions to Selected Problems 

Problem 7.3 

(i) Consider the sequence of polynomials defined by repeated appli- 
cations of Euclidean divisions: 

Ro = A 

Ri = B 

i?2 = Rcmainder(i?o, R\) 

Ri = Rcmaindcr(i?i_2, Ri-i) 

Rk = Rcmaindcr(i?fe-2, Rk-i) = r o € L. 

where without loss of generality, we have assumed that deg(A) > dcg(B). 
We have assumed that Rq, Ri, . . ., Rk~i are all nonconstant polynomials 
over L. By the results of Problem 7.2, we see that 

Resultant(A, B) = Resultant (Ro, Ri) = if and only if r = 

if and only if A and B have a common nonconstant factor. 

(ii) First note that, for any cti, 

B(a.i) = b n (a, t - 0i)(ai - f3 2 ) ■ ■ ■ (a t - (3 n ), 

and hence 

ra m n 

i—1 
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Thus, it suffices to prove that 

m 

Resultant(A, B) = a™ JjB(ai). 

i=l 

The proof follows by showing that the right-hand side of the equation 
above satisfies all three properties of Problem 7.2: Properties (1) and (2) 
arc trivial. To see property (3) note that 

If R{x) is the remainder of B(x) with respect to A{x), then 
B{on) = Q{a l )A{a l ) + R(a t ) = R{on). 



Thus, 



Resultant (A, B) = a™ J]B(a t ) 

»=i 

n 

= (-ir n c n^) 



m n 

c k nn^-^)- 

»=i j=i 



Problem 7.8 

Note that the characteristic polynomial of Ca can be calculated as 
follows: 



X(C A ) 



A -1 

A -1 

A 



a a\ a 2 



A -1 

A + 

®m— 1 



Thus, by using the results of Problem 7.3 (and working over algebraic 
closure of K), we have: 



x(Ca) = Resultant x ^Aa; — 1, 



aox™- 1 + ai x rn - 2 



+ a m -ix + a m -i 



= A 1 ' 



° A 



+ a m _2 



O.m-1 + A 



m-2 



+ 
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[since Resultant ( a (x - a), F(x)) = a dcs{F) ■ F(a).} 
= X m + a ro _iA ro " 1 + • • • + aiA + a . 
= A(X). 

A somewhat direct argument can be given by noting that Ca represents 
a linear transformation cf), over K[x]/(A(x)), the residue class ring modulo 
the ideal generated by A(x): 

4, : K[x]/(A(x))^K[x]/(A(x)) 
: B(x) i ► x ■ B(x) mod A(x) 

In particular, if a is a root of A(x), then a is an eigenvalue of Ca with 
eigenvector 



since 





(1, 


a, a 2 , . . 











1 















" 1 








1 












a 





















a 2 


-a 




-012 




2 —dm 




-l 








a 




























a 2 














a 3 














-a 


— aia — 












a 






" 1 








a 2 






a 








a 3 




= a ■ 


a 2 








_ a m - 


- A(a) _ 











Now since x(Cyt) is clearly a monic polynomial of degree m with the 
same roots as A(x), we have 

X (C A ) = det (A/ m - C A ) = A(X). 

The proof for the second part proceeds by showing that the eigenvalues 
of the matrix B{Ca) are simply _B(Ai)'s, where A^'s are respectively the 
eigenvalues of Ca and hence zeroes of A: 

If Vi is an eigenvector of Ca with eigenvalue A^ , then 

B{C A )-v t = C n A ■ v t + b^Cy 1 ■ v, + ■ ■ ■ + b x C A ■ v, + b oVi 
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= \'?vi + b n -i\™ Vi-\ + biXiVi + b Vi 

(since C k A - v l = C k ^ 1 X i v l = ■ ■ ■ = X k Vi) 
= B(Xi)vi. 

Hence, Vi is an eigenvector of B(Ca) with eigenvalue B(Xi), and the deter- 
minant of B(Ca) is simply the product of the eigenvalues B(Xi)'s. 
Now, it follows that 

m 

det(B(CU)) = n S ( A «) = Resultant (^4, B). 

i=l 

Problem 7.14 

Proposition 7.10.6 Let S be a unique factorization domain. Let A(x), 
B(x) e S[x] be two univariate polynomials of degree m and n, respectively, 
and a, (3 e S. Then 

PRemainder(aA(a;), (3B(x)) = a/3 5 PRemainder(A(x), B(x)), 

where S — maxfra — n + 1, 0}. 

PROOF. 

Let 

R(x) = PRcmaindcr(A(x), B(x)) and 
R'{x) = PRemaindcr(ayl(a;), (3B(x)). 

Then 

b 5 n A{x) = Q(x) ■ B(x) + R(x) 

Therefore, 

((3b n ) s ■ aA(x) = af3 s Q(x) ■ B(x) + af3 s ■ R(x). 

Since the second term on the right-hand side has degree less than n, R'(x) — 
a[3 5 R{x) as desired. □ 

Problem 7.15 

This is a simple consequence of the preceding problem. Define a se- 
quence 71, 72, . . ., as 

71 = 1 

72 = 1 
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7i-i = 1 
li = Q 

A, A 1+1 ..-A m _, 
7m — c l 



Clearly 7,|cj, 1 < /. We want to prove the statement for m > I assuming 
that it is true up to m — 1 . 

We may write Fj = -fiGi, for 1 < i < to. Then 

F m = PRemainder(F m _ 2 ,-F 1 m-i) 

= PRcmainder(7 m _ 2 G m _2, lm-\G m -i) 
= 7m-2 7m-! 1 PRemaindcr(G m _2,G m _i). 

Thus 7 m = cf ' A '+ 1 '" A ™- 1 = j^™' 1 divides the coefficients of F m and 
thus 7 m | Content (F m ). 

The second part is a simple consequence of the above observations: 
since A; > 2 for all / > 2, 

cf \c m , m > I > 2; 
and the size of c m is of the order O (2 m size(c ; )). 

Problem 7.17 

The solution builds on Problem 7.14. 

(i) The "only-if" part is obvious as Pi divides Cj. The "if" part follows 
from part (ii) of this problem. 

(ii) We will prove that 

Fi = a, ft_ 2 pti 1 Pi Fi 

where a\ = a 2 = 1 and 

A 4 _i a fl Ai_ 3 +A,_i fl (A,_ 2 )(A i _ 1 ) 

a i =a i - 2 a i _ 1 ft_ 4 Pi_ 3 p\_ 2 

We use induction on i. For i = 1, the relation is true by definition itself. 
Assume it is true for all i' < i. Now consider i' = i. 

Fi = PRcmainder(i ;l i _2, F_i) 

= P Remainder ( a^ 2 Pi-4 Pfl^ 3 Pi-1 F-_ 2 , 

Oi-l Pi-3 Pt-2 2 Pi-1 FU) 
= Oli-2 Pi~A Pt-3 3 Pi-3 Pt-2 2 ) 
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Pf-i 1 PRcmainder( J F 1 /_ 2 ,i^_ 1 ) 
= a< A-2 /3^T 1 PRemainder(i^'_ 2 , F[_ 1 ) 
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8.1 Introduction 

In this chapter, we focus our attention on real algebra and real geometry. 
We deal with algebraic problems with a formulation over real numbers, R 
(or more generally, over real closed fields). The underlying (real) geometry 
provides a rich set of mechanisms to describe such topological notions as 
"between," "above/below," "internal/external," since it can use the inher- 
ent order relation (<) of the real (or, real closed) field. As a result, the 
subject has found many applications in such practical areas as computer- 
aided manufacturing , computational geometry, computer vision, geometric 
theorem proving, robotics, and solid modeling, etc., and thus has generated 
a renewed interest. We concentrate on the following key ingredients of real 
algebra: Sturm theory, algebraic numbers, and semialgebraic geometry. 

We start our discussions with some preparation for the classical Sturm 's 
theorem, which permits us to determine the exact number of real roots of 
a real polynomial in an interval. 

Our starting point is with Artin and Schreier's theory of formally real 
fields developed in the late 20's. In particular, we focus our attention 
on the real closed fields, which are the formally real fields maximal under 
algebraic extension. Intuitively, real closed fields are "almost algebraically 
closed" as they are only missing and thus capture some of the most 

essential properties of the field of real numbers, R. [More formally, if K 
is a real closed field, then K (V - 1) is algebraically closed.] With these 
preparations, we move on to Sturm theory in a rather general setting. 

Next, we consider a particularly interesting subclass of real numbers, 
R: the field of real algebraic numbers, A. Both real algebraic numbers and 
integers are useful to computer scientists because they provide an effective 
way of representing infinite precision numbers that crop up in computa- 
tions involving an algebraic formulation. In many situations, where it is 
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paramount that the computed answers be at least topologically consistent 
(possibly incorrect) — a weaker requirement, which is nevertheless quite sat- 
isfactory in many situations — computing with algebraic numbers provides 
a practical solution. 

Finally, we study how building up on Sturm theory and algebraic num- 
ber representations one can devise a decision procedure for a wide class 
of geometric statements: the Tarski sentences. Roughly speaking, if such 
statements have a feasible real model, then they have a model that can be 
described by algebraic numbers and few auxiliary rational polynomials. Be- 
cause algebraic numbers can be finitely described, the corresponding finite 
model can be sought for in a bounded amount of time. 

Using these methods, one can consider the geometric objects defined 
by algebraic equations and inequations (i.e., semialgebraic sets), their de- 
compositions (e.g., cylindrical algebraic decomposition), various topological 
properties (e.g., connectivity), triangulations, stratifications, etc. We only 
deal with some of the very basic problems in this field. 

8.2 Real Closed Fields 

We begin with a few definitions. 

Definition 8.2.1 (Ordered Field) An ordered field K is a commutative 
field K together with a subset P, the set of positive elements, of K such 
that we obtain the following: 

1. 0(^P. 

2. If a G K, then either a € P, a = 0, or —a e P. 

3. P is closed under addition and multiplication. If a, b € P, then so 
are a + b and a b. □ 

Clearly, in a field K with a linear ordering >, one can identify P, the 
set of positive elements, as follows: 

• If a > 0, we say a is positive; P = {a e K : a > 0}. 

• If — a > 0, we say a is negative: N = {a e K : —a > 0}. 

Therefore, K = P U {0} U N. 

We can introduce an ordering in the ordered field K (or more precisely, 
(K, P)) by defining 

a > b if (a-b) e P. 
This ordering relation ">" is a strict linear ordering on the elements of K: 

a + c > b + c 
a c > b c 

(c) a > b, a > 0, b > b^ 1 > a^ 1 



(a) 



a > 



b => NceK) 



(b) 



a > b 



(v c e P) 
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Thus, we could have defined an ordered field in terms of a binary tran- 
sitive relation > as follows: 

1. Trichotomy: a = or a > or —a > 0. 

2. Closure Under Additions and Multiplications: a > and b > =>■ 
ab > and a + b > 0. 

In an ordered field, we can define various notions of intervals just as on 
the real line: 

• Closed Interval: 

[a, b] = {x e K : a<x< b}. 

• Open Interval: 

(a,b) = {x g K : a < x <b}. 

• Half- Open Intervals: 

(a, 6] = {x € K : a < x < b} and [a, b) = {x G K : a < x < b} 



Definition 8.2.2 (Absolute Value) 

is defined to be 




(a) \a + b\ < \a\ + \b\, 

(b) \ab\ = \a\ \b\. □ 



Absolute value of an element a g K 
if a > 0; 

if a < 0. 



Definition 8.2.3 (Sign Function) The sign of an clement aeJfis de- 
fined to be 

( +1, ifa>0; 
sgn(a) = < — 1, if a < 0; 

[ 0, if o = 0. 

(a) a = sgn(a) |a| and \a\ — sgn(a) a. 

(b) sgn(afe) = sgn(a) sgn(6). □ 

In an ordered field K with 1, 

-1< < 1, 

since if — 1 > then (—1) + (—I) 2 = > 0, which is impossible. 
In any ordered field K, 

a^0 => a 2 = (-a) 2 = |a| 2 > > -1. 
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Hence, 

1. V~lgK. 

2. If oti, a 2 , . .., a r are ^ 0, then ^ af > > —1, since 

E«? = Ek 2 i^|E a * 2 |> ' 

and the relation ^ of = has the only solution a, = 0, for all i. 
Additionally, we see that in an ordered field —1 cannot be expressed 
as a sum of squares. 

The discussion above leads to the following definition: 

Definition 8.2.4 (Formally Real Field) A field K is called formally 
real if the only relations in K of the form Y^i=i a i = ® are those for which 
every a, = 0. □ 

From the preceding discussions we conclude that 

Corollary 8.2.1 

1. Every ordered field is formally real. 

2. K is formally real if and only if ' — 1 is not a sum of squares of elements 
ofK. 

3. A formally real field is necessarily of characteristic zero. 
PROOF. 

(1) See the preceding discussion. 

(2) First note that if for some bj's, 

n 

4=1 

then the relation (&i) 2 H h (b n ) 2 + (l) 2 = is a nontrivial solution of 

the equation YH=i a i = 0' an d K IS n °t a formally real field. Conversely, 
if K is not a formally real field, then for some set of nonzero bi € K (i = 0, 
. . ., n), we have 

bl + b\ + ■ ■ ■ + bl = 0, 

or 

(h/b a ) 2 + --- + (b n /b ) 2 = -l. 

Clearly, bi/bo are defined and in K. 

(3) Note that in a field of characteristic p, we have 

I 2 + --- + 1 2 = 0. □ 

V v ' 

p-many 
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Definition 8.2.5 (Induced Ordering) If K' is subficld of an ordered 
field (K, P), then K' is ordered relative to P' = K' n P. We call this the 
induced ordering in if'. □ 

Definition 8.2.6 (Order Isomorphism) If (K,P) and (K',P') are any 
two ordered fields, then an isomorphism r\ of K into K' is called an order 
isomorphism if r)(P) C P'. This implies that 77(0) = 0, r)(N) C AT' and, if 
77 is surjective, then r/(P) = P' and ^(AT) = N'. □ 

Definition 8.2.7 (Archimedean Ordered Field) The ordering of a field 
is called Archimedean if for every field element a, there exists a natural 
number n > a, where n stands for 

s . ' 

n-many 

In this case there exists also a number — n < a for every a, and a fraction 
1/n < a for every positive a. 

Note that the ordering of the field of rational numbers, Q, is Archimedean. 
As an immediate corollary, we have: 

Corollary 8.2.2 For any two elements a < b in an Archimedean ordered 
field, K, there are infinitely many points between a and b. 

PROOF. 

First note that, in K, there is an infinite strictly increasing sequence of 
elements in K: 

1 < n\ < n 2 < n 3 < ■ ■ ■ . 
Let k be the smallest index such that (6 — a) > n^ 1 , then 

a < (b - n- 1 ) <{b- n-^) < (b - n^ 2 ) < • • • 
and each such element (b — n^ + -) is between a and b. □ 

Definition 8.2.8 (Real Closed Fields) We call an ordered field (K, P) 
real closed, if it has the following properties: 

1. Every positive element of K has a square root in K. 

2. Every polynomial f(x) € K[x] of odd degree has a root in K. □ 
An alternative definition for a real closed field K is the following: 

K is formally real and no proper algebraic extension of K is 
formally real. 

We will say more about this later. We state a fundamental result without 
proof. 
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Theorem 8.2.3 (Fundamental Theorem of Algebra) If K is a real 
closed field, then K{\/^1) is algebraically closed. □ 

An immediate corollary of the fundamental theorem of algebra is the fol- 
lowing: 

Corollary 8.2.4 The monic irreducible polynomials in K[x] (K = real 
closed) are either of degree one or two. □ 

Furthermore, if K is real closed, then its subfield of elements which are 
algebraic over Q (c R) is real closed. As before, in a real closed field, we 
write b > a for b — a G P. 

A classical example of a real closed field is, of course, R, the field of real 
numbers. 

Lemma 8.2.5 A degree two monic polynomial x 2 + ax + b G K[x] over a 
real closed field, K, is irreducible if and only if a 2 < 46. 

PROOF. 

Write the degree two monic polynomial in the following form: 



[Note that, by definition, c exists, since 46 — a 2 e P and K is real closed. 




The proof is by following the three possible cases: 
Case (1): 46 > a 2 . 




Also c 2 = 




Case (2): 46 = a 2 . 




x 2 + ax + b = 



reducible. 



Case (3): 46 < a 2 . 



46 < a 2 => 



a 2 -4b£P 




x 2 + ax + 6 = 



reducible. 



□ 
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Lemma 8.2.6 A real closed field has a unique ordering endowing it with 
the structure of an ordered field. That is, any automorphism of a real closed 
field is an order isomorphism. 



Consider a real closed field K, and let K 2 be the set of elements consisting 
of squares of nonzero elements of K. Now, consider (K, P), some arbitrary 
ordered field structure on K. 

We know that K 2 C P. Conversely, select an element b G P; by 
definition, for some a G K, (a ^ 0) a 2 — b. Hence, b E K 2 , and K 2 = P. 

The unique ordering is then given by >, with b > a if and only if 



It is now useful to go back to the alternative definition: 
Theorem 8.2.7 An ordered field K is real closed if and only if 

1. K is formally real, and 

2. no proper algebraic extension of K is formally real. 

PROOF. 

(=>) Since every ordered field is necessarily formally real, the first condi- 
tion is easily satisfied. 

We only need to consider extensions of the kind K(y^j), where x 2 — 7 
is an irreducible polynomial in K. Thus < —47, or 7 ^ K 2 , the set of 
squares of nonzero elements of K. Hence —7 € K 2 and —7 = a 2 . But then 



thus showing that K(y/j) is not formally real. 

(<=) In the converse direction, first consider the field K with the ordering 
defined by P = K 2 (where K 2 , as before, denotes the squares of the nonzero 
elements in K). There are essentially two objectives: First, to prove that 
(K, K 2 ) is an ordered field; second, to show that every polynomial equation 
of odd degree is solvable. We proceed in order. 

The only difficult part is to show that for every a e K (a / 0), cither 
a e K 2 or —a G K 2 . Other conditions are trivially satisfied since the sums 
of squares are squares and the products of squares are obviously squares. 
Suppose a is not a square, then x 2 — a is irreducible and K(y/a) is not 
formally real, i.e., 



PROOF. 



b-aeK 2 . □ 



in K(^y), 




-1 = 
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But then the last term should vanish, since otherwise y/a would be a zero 
of the following polynomial in K[x\: 



Thus 



EA 2 



G if 2 , 



as both numerator and denominator arc in K 2 . 

The second part is shown by contradiction. Consider the smallest odd 
degree irreducible polynomial f(x) G K[x). [Clearly deg(/) > 1.] Let £ be 
a root of /; then K(£) is not formally real: 

where deg(gi) < deg(/). Then we see that 

-l = Y,9i{x?+h{x)f{x), 

by virtue of the isomorphism between K{£) and K[x]/(f(x)). By examining 
the above identity, we see that h is of odd degree and deg(/i) < deg(/). 
Now substituting a root £' of h, in to the above equation, we get 

We conclude that h is irreducible and that our original choice of / leads to 
a contradiction. □ 

Theorem 8.2.8 Let K be a real closed field and f(x) G K\x\. If a, b E K 
(a < b) and f(a) f(b) < 0, then there exists a root of f(x) which lies between 
a and b, i.e., 

(3ce(M)) [/(c) = o . 

PROOF. 

Assume f(x) is monic. Then f(x) factors in K[x] as 



f(x) = (x- n)- • • (x - r m ) ■ g 1 (x) ■ ■ ■ g s (x), 



where each gi(x) 



CiX + di is, by an earlier corollary, an irreducible 



monic polynomial of degree 2 and 

(v^eif) [#(«)> o], 

i.e., the quadratic factors are always nonnegative. We know that 



( 



VKKm 



) 



a 7^ ri and b ^ r. 
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since f(a)f(b)^0. 

Let us now consider the effect of each root n on 

f(a) f(b) = Y\_( a — r i)(b — ri) x some nonnegative value, 



a < ri A b <Ti 
a > Ti A b > Ti 
a < n A b > Ti 



(a-n)(b-n) > 0, 
(a-ri)(b-ri) > 0, 
(a-ri){b-ri) < 0. 



This implies that if a root lies between a and b, then it contributes a 
negative sign to f(a) f(b); and if r t does not lie between a and 6, then it 
does not affect the sign of f(a) f(b). 

Hence /(a) f(b) < implies that there exist an odd number (and hence 
at least one) of roots of f(x) between a and b. □ 

Corollary 8.2.9 Let K be a real closed field and f(x) e K[x] such that 
/(c) > 0. Then it is possible to choose an interval [a, b] containing c such 
that 



(v«e [M]) [/( 



u) > 



□ 



Theorem 8.2.10 (Rolle's Theorem) Let K be a real closed field and 
f(x) e K[x]. If a, b G K (a < b) and f(a) = f(b) = 0, then 



(3ce(a,6)) [D(/)(c)=0 



where D denotes the formal differentiation operator. 

PROOF. 

Without loss of generality assume that a and b are two consecutive roots 
of the monic polynomial f(x), of respective multiplicities m and n: 



(x - a) m (x - b) n g(x). 
(x-a)™- 1 (x-bY^gix), 



Now 

D(f)(x) - 
where g{x) is given by 

g{x) = [m(x — b) + n(x — a)]g{x) + (x — a)(x — b)D(g){x). 

Now, note that 

S(a)g(b) = -mn(a - bfg(a)g(b) < 0, 

as by assumption g does not have a root in (a, b), and sgn(g(w)) is un- 
changed for all u e [a, b}. 

Now by our previous theorem: 

= . 
0. □ 



3ce(a,6) g(c) 



Hence D(f)(c) = (c- a) m - 1 (c - 6)™" 1 5(c) 
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8.3 Bounds on the Roots 



Given a polynomial /, we obtain bounds on its roots by showing that every 
root u must be in an interval u € (M l7 M 2 ), or equivalently 



We set out to find such Mi and M% as functions of the sizes of the coef- 
ficients. Note that then, \M\\ = |M 2 |, since if u is a zero of f(x), —u is a 
zero of f(x) = f(—x) and the sizes of / and / are same. Thus, we seek 
bounds of the kind |w| < M for the roots u of /. 

Theorem 8.3.1 Let K be an ordered field, and 

f(x) = x n + dn^x 71 - 1 + ■ ■ ■ + ao, 

a monic polynomial with coefficients in K . Let M and N denote the fal- 
lowings: 



M = M(f) = max(l,|a„_i| •+ 
N = N(f) = l + max(|a„_i| 



\-\ao\), 
l«o|)- 



1. If\u\ > M, then \f(u)\ > 0. 

2. If\u\ > N, then \f(u)\ > 0. 



PROOF. 

First note that 



(Vu) [|/(«)|>0 
We may only consider a root u of /, \u\ > 1. Note that 
f(u) = =4 



f(u) = u n + dn-m 71 - 1 + ■ ■ ■ + a = 
\u\ n < |a„_i| • It*!™- 1 H h |oo|- 



Thus, 



in-l 



(1) \u\ n < |a„_i| • |it| 
^ |u|"<M-|ur- 1 
=> \u\ < M. 



\ a o\ 



(2) H"<|a„_j 



in-l 



" *H h|a | 

u\ n < (N — 1) ■ H + 1) < 

u\ - 1< N - 1 
u\ < N. □ 



(N — 1) • 
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Hence, 

(v \u\ > M, \u\ > jv) [|/(u)| > O] . □ 
The corollary below follows: 
Corollary 8.3.2 (Cauchy's Inequality) Let 

f(x) = a n x n + a n - X x n ~ x H h a . 

be a polynomial over K , an ordered field. 

Then any nonzero root u of f must satisfy the followings: 

\ao\ ^ii^ +max(|a„_i|,...,|a |) 

(1) i — i 71 — i i — nr < M < i — i : 

|ao|+max(|a n |,...,|ai|) \a n \ 

min(/) I. min(/) + max(/) 

1 } min(/)+max(/) 1 1 min(/) 

where 

min(/) = min{|aj| : at ^ 0} and max(/) = max{|ai| : 7^ 0}. 



PROOF. 

This is a direct consequence of the preceding theorem. Assume, without 
loss of generality, that u is a nonzero root of f{x) 

f(x) = a n x n + a n -\x n ~ x H V a m x m , a m _i = • • • = a = 0. 



Then clearly, u is a root of 
a 

and u^ 1 is a root of 



a n x ~\~ a n —ix ~\~ ' ' ' ~\~ a>m: 



a m x n m + a m+1 x n m 1 + --- + a n . 
Using the preceding theorem: 

\a n \ + max(|q»_i|, . . . , \a m \) < min(/) + max(/) 
\a n \ min(/) 

and 

_ x |a m | +max(|a m+ i|,...,|q Tt |) min(/) + max(/) 

M II — ■ / r\ 

\a m \ min(/) 

As a result, we have 

\a m \ |o n |+max(|a„_i|,...,|fl m |) 
< M < : : ; 



\a m \ + max(|o„|, . . . , |a m +i|) 

and 



min(/) ^ < min(/) + max(f) ^ 



min(/) + max(.f ) min(/) 
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Using Cauchy's and Landau's inequalities, we can prove the following 
useful theorem: 

Theorem 8.3.3 Let f(x) e be an integral polynomial 
f(x) = a n x n + a„_ia; n_1 H h a . 

Let 

ll/lli = |o n | + |a n _i| + --- + |ao|, 
||/|| 2 = (a* + a 2 n _ 1 + --- + a 2 )^, and 
1 1 ./* 1 1 00 = max(|o n |,|a n _i|,...,|a |). 

Then all the real roots of f are in the intervals (— ||/||i, ||/||i), [— H/H2, 
H/H2] and (-1 - H/lloo, 1 + H/lloo). 

Also for every nonzero real root u of f, 

1 

' ' 1 + H/lloo" 

PROOF. 

(1) All the real roots of / are in the intervals (— ||/||i, ||/||i)- 
Since f{x) is integral (i.e., if a, 7^ 0, then |aj| > 1), 

ll/lli = |o n | + |o„_i| + --- + |ao| 
> l + |a n _i|H h|a | 





a n -i 


+ • 


• + 


a 






) 








a n 





which is no other than the M of Theorem 8.3.1. 

(2) All the real roots of / are in the intervals (—1 — ||/||oo, 1 + ||/||oo)- 
As before, we use the fact that, if dj 7^ 0, then \ai\ > 1. 



1 + H/lloo = l+max(|a n |,|o n _i|,...,|a |) 
> 1 + max(l, |a n _i|, . . . , |a |) 



( 


a-n-i 




a 


) 




a n 


, . . . , 


a n 





which equals the N of Theorem 8.3.1. 

(3) All the real roots of / are in the intervals [— H/H2, H/lh]- This is a 
direct consequence of Landau's inequality, which states that if ai, «2, • • •, 
at are roots of /, then 



|o„| JJmax(l,ai)< H/lla. □ 
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8.4 Sturm's Theorem 

Consider a polynomial f(x) e K[x\ where K is a real closed field. A 
classical technique due to Sturm shows how to compute the real zeros of 
f(x) in an interval [a, b]. The recipe is as follows: 

f. First compute a special sequence of polynomials STURM (/) = (ho(x) = 
f(x), hi(x), . . ., h s (x)), which will be called a Sturm sequence. 

2. Next compute the "variations in sign" for the sequences STURM(/)(a) = 
(ho(a), hi (a), . . ., h s (a)} and sturm (/)(&) = (ho(b), hi(b), . . ., 
h s (b)) — denoted, respectively, by Var a (STURM(/)) and Varj, (sturm ( / ) ) . 

3. Then 

# real zeros of f(x) in (a, b) — Var a (sturm (/) ) — Var& (STURM (/) ) . 

However, these notions need to be formalized further. In this section, 
we shall start with a slightly general version of the statement above: the 
Sturm- Tar ski theorem. Later on we shall study some generalizations to 
higher dimensions. 

Definition 8.4.1 (Variations in Sign) If c = (ci, . . ., c m ) is a finite 
sequence of nonzero elements of a real closed field K, then we define the 
number of variations in sign of c to be 



In general, If c = (ci, . . ., c m ) is an arbitrary sequence of elements of K, 
then we define the number of variations in sign of c, denoted Var(c), to be 
the number of variations in the sign of the abbreviated sequence abb(c), 
which is obtained by omitting the zeros in c. 

Thus, Var(c) is the number of times the entries of c change sign when 
scanned sequentially from left to right. □ 

For example, (1, 0, 0, 2, —1, 0, 3, 4, —2) has three variations in sign. 
Note that for any nonzero a e K 



where a (ci, . . ., c m ) — (ac\, . . ., ac m ). Similarly, if c, • c i+ i < then for 
any a e K 

Var((ci, . . . ,Cj,c i+ i, . . . ,c m )) = Var((ci, . . . , Cj, a, Cj+i, . . . , Cm)). 

For a vector of polynomials F = (fi(x), . . ., f m {x)) € K[x] m and a field 
element a € K, we write Var a (F) for Var(F(a)): 



i : 1 < i < m and Cj • C{ + i < 




Var(c) = Var(ac), 



Var„(F) = Var(F(a)) = Var«/i(o), . . . , f m {a))). 
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Definition 8.4.2 (Standard Sturm Sequence) The standard Sturm se- 
quence (or, canonical Sturm sequence) of a pair of polynomials f(x) and 
g(x) € K[x] (K = & field) is 

STURM (f,g) = (ho(x),h!(x), . . . ,h s (xj), 



where 

ho{x) = f(x) 

hi{x) = g(x) 

h (x) = q\{x) hi(x) - h 2 (x), deg(/i 2 ) < deg(fti) 

fti_i(ar) = g^z) hi(x) - h i+1 (x), deg(/i i+ i) < deg(ftj) 

ft s _i(a;) = g s (x) /i s (x). □ 

Note that the standard sequence is termwise similar to the polynomial 
remainder sequence, except that we take the negation of the remainder 
at each step, i.e., it is based on the sequence (1, 1, —1, —1, ...). Also 
note that for any f(x) and g(x), the last element in their standard Sturm 
sequence, STURM (/, g), is in fact their GCD (up to a sign); in particular, 
h s (x) | hi(x) (0 < i < s). In this case, we may consider their "suppressed" 
Sturm sequence, 

- _ Ux)_ 

M j ~ h s ( x y 

Note, h s (x) = 1. Furthermore, for any i > 0, we also have 
hi-i(x) = q l (x) hi(x) - h l+ i(x). 



Lemma 8.4.1 Let f(x) and g{x) <G K[x], K = a real closed field, be a pair 
of polynomials with a standard Sturm sequence, 

Sturm (/, g) = H = (ho(x),hi(x),...,h s (x)y 

Let [a,b] be an interval (a < b) not containing a zero of f(x) = ho(x). 
Then 

Var a (sturm (f,g)) - Var fc (STURM (f,g)) = 0. 

PROOF. 

Let us consider an arbitrary subinterval [a', b'} C [a, b] such that it contains 
at most one zero of the suppressed sequence 

H = (h (x),h-i(x), h s (x)). 
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It suffices to show that 

Var a /(STURM(/, g)) - Vary (sturm(/, g)) = 0. 

Since the interval does not contain a zero of ho, nor does it contain a zero 
of h - 

Now, if the interval does not contain a zero of any element of H, then 
hi(a') hi{b') > for all i and 

Var a ,(H) - Var 6 ,(#) = 0. 

Alternatively, assume that the interval contains a zero of hi g H, i.e., 
for some c g [a', b'\, ftj(c) = (i > 0). Then 

ftj_i(c) = g»(c) ftj(c) - ft i+ i(c) = -ft i+ i(c) 
^ ^i_i(c) fti+i(c) = -/i-+i(c) < 0. 

Since /i i+ i(c) = would imply that hi(c) = h i+ i(c) = ■■■ = h s (c) = 0, 
contradicting the fact that h s = 1. Thus, 

ftj_i(c) • ft»+i(c) < 
=> hi-i(a') ■ h i+1 (a') < A h^b') ■ h i+1 (b') < 0, 

which results in 

Var a ,(H) = \ & v{h {a'),...M-i{a')M+i{a'),...,h s {a')) 
= Var(/i (6'), ■ • • , hi-i(b'), h i+ i{b'), . . . , h s (b')) 
= Va,r b ,(H). 

Thus, in either case 

Var a ,(£)-Var fc ,(i?) = 0. 

Thus 

Var„/ (sturm (/, g)) - Vary (sturm(/, g)) 
= Var a ,(M«') H) - Vta v (h,(l/) H) 

= 0. 

Clearly, the interval [a, b] can be partitioned into finitely many subintervals 
such that over each subinterval there is no net change in the variation, thus 
proving that there is no net change over the entire interval [a, b], either. □ 

Let D:K[x] — > K[x] be the (formal) derivative map: 

D : x n ^n- x n ' x 

: a i ► , a g K. 

Assume that f(x) e K[x] is an arbitrary polynomial over a real closed 
field K and f'(x) = D(f(x)) its formal derivative with respect to x. 
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Lemma 8.4.2 Let f(x) and g(x) <G K[x], K = a real closed field. Consider 
a standard Sturm sequence of the polynomials f(x) and f'(x) g(x), 

sturm(/, f'g) = H = (h (x),h 1 (x),...,h s (x)y 

Let [a, b] be an interval (a < b) containing exactly one zero c G (a, b) of 
f(x) = ho(x). Then 

Var a (STURM (/, f'g)) - Var 6 (STURM(/, f'g)) = sgn(.g(c)). 

PROOF. 

Without loss of generality, we may assume that none of the hi (x) 's vanish 
in either of the half-open intervals: [a, c) and (c, b]. 
Let us write f(x) and g(x) as follows: 

f(x) = (x - c) r (fr(x) and g{x) = (x - c) s ip(x), r > 0, s > 0, 

where, by assumption, we have <p(c) ^ and ip(c) ^ 0. 
Then, 

f'(x) = (x- cf- 1 [r4>{x) + (x - c)<f/(x)] . 

Thus 

f(x) f'(x)g(x) = (x- c) 2 ^- 1 [r^{x)^{x) + (x - c)<f>(x)<f! (x)1>(xj\ . 
Now, we are ready to consider each of the cases: 

• Case 1 : s = 0, i.e., g(c) ^ 0. 
In that case, 

f(x) f'(x)g(x) = {x- c) 2r - 1 [r(j) 2 (x)i>{x) + (x - c)<f>(x)<f>' (x)i/;(x)] , 

an odd function of x in the neighborhood of c e [a, 6]. If g(c) > 0, 
then in the neighborhood of c, 

f{x)f'(x)g(x)^(x-c^- 1 [k + + e] 1 

where k + = r(f> 2 (c)ip(c) > 0. Thus to the left of c, f(x) and f'{x)g{x) 
have opposite signs and to the right same signs, implying a loss of 
sign as one moves from left to right past c: 

Var a (STURM(/, f'g)) - Var 6 (sturm(/, f'g)) = +1. 

Similarly, if g(c) < 0, then in the neighborhood of c, 

f(x)f(x)g(x) = (x-cr- 1 lk-+e], 

where k- = rcfi 2 (c)f/»(c) < 0. Thus to the left of c f(x) and f'(x)g(x) 
have same signs and to the right opposite signs, implying a gain of 
sign: 

Var a (STURM(/, f'g)) - Var 6 (STURM(/, f'g)) = -1. 
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• Case 2 : s > 0, i.e., g(c) = 0. 

1. SubCase 2A : s = 1. 
In this case 

/io(a;) = / (x) = (x — c) r <p(x), and 
hi(x) = f{x)g{x) = (x - c) r [r<f>(x)i/>(x) + (x - c)cj>' (x)^(x)}. 

Thus 

^iW = = r<l>{x)1>{x) + (x-c)<lt{x)1>{x). 

[x c) 

Hence the suppressed sequence has no zero in the interval [a, b], 
and thus the suppressed sequence undergoes no net variation of 
signs. Arguing as in the previous lemma, we have: 

Var Q (sTURM(/, f'g)) - Var b (STURM(/, f'g)) = 0. 

2. SubCase 2B : s > 1. 

In this case deg(/i ) = dcg(/) < deg(f'g) = deg(h\); thus, 

ti2(x) = -h (x), 

i.e., the first and third entry in the sequence have exactly the op- 
posite signs. Again considering the suppressed sequence, we see 
that the suppressed sequence (and hence the original sequence) 
suffers zero net variation of signs. Hence: 

Var a (STURM(/, f'g)) - Var 6 (STURM(/, f'g)) = 0. □ 

Theorem 8.4.3 (General Sturm- Tarski Theorem) Letf(x) andg(x) 
be two polynomials with coefficients in a real closed field K and let 



STURM (/,/' 5 ) =^ h (x) = f(x), 

hi{x) = f'(x)g(x), 
h 2 (x), 



h s (x) 

where hi 's are related by the following relations (i > 0): 

hi-i{x) = qi(x) hi(x) - h i+1 (x), deg(/i i+ i) < deg(/ij). 
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Then for any interval [a, b] C K (a < b): 

b 



Var 



STURM(/, f'g) 



c f 



g>0 



c f 



g<0 



where 



Var 



STURM {f, f'g) 



Var a (sturm (/, f'g)) - Var 6 (STURM(/, f'g)). 



and Cf[P]a counts the number of distinct roots (g K , and without counting 
multiplicity) of f in the interval (a, b) C K at which the predicate P holds. 



PROOF. 

Take all the roots of all the polynomials /i 3 (x)'s in the Sturm sequence, and 
decompose the interval [a, b] into finitely many subintervals each containing 
at most one of these roots. 

The rest follows from the preceding two lemmas, since 



Var 



STURM {f, f'g) 



s S n (ff( c ))- n 

c€(o,6), /(c)=0 



Corollary 8.4.4 Let f(x) be a polynomial with coefficients in a real closed 
field K. 

f(x) = a n x n + a„_ia;™ _1 H h a . 

Then 

1. For any interval [a, b] C K (a < b): 

b 



Var 



STURM(/, /') 



= ^distinct roots € K of f in the interval (a,b). 

2. Let L e K be such that all the roots of f(x) are in the interval (-L, 
+L); e.g., 

_ \a n \ + max(|a»_i|, . . . , |a |) 
\a n \ 

Then the total number of distinct roots of f in K is given by 



Var 



STURM(/, /') 



Var_i(sTURM(/, /'))-Var +L (sturm (/, /')). 



PROOF. 

(1) The first part is a corollary of the preceding theorem, with g taken to 
be the constant positive function 1. 

(2) The second part follows from the first, once we observe that all the 
roots of / lie in the interval [— L, +L\. □ 
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Corollary 8.4.5 Let f(x) and g(x) be two polynomials with coefficients 
a real closed field K, and assume that 



f(x) and f'(x)g(x) are relatively prime. 



Then 



Var 



STURM(/, 5 ) 



-lb 



= c f 



1 b 



{f'g) > -c f (f'g) < 



i b 



□ 



Corollary 8.4.6 Let f(x) and g(x) be two polynomials with coefficients in 
a real closed field K. For any interval [a, b] C K (a < b), we have 



Cf 





b 




b 






g = 


+ c f 

a 


g>0 


a 


+ Cf 


g<0 








b 








Cf 


g>0 


a 


- Cf 


g<0 








b 








Cf 


g>0 


a 


+ Cf 


g<0 



= Var 



= Var 



sturm(/, /') 



Sturm(/, f'g) 



Var 



STURM (/, f'g 2 ) 



Thus the previous corollary can be expressed as follows, 



1 1 1 
1 -1 
1 1 

or equivalcntly: 



c f 



.9 = 



9 > 



3<0 



n 6 



Var 
Var 
Var 



STURM(/, /') 



STURM {f, f'g) 



STURM (/, f'g 2 ) 






-1 


1 


1 


2 


2 


1 


1 


2 


2 



Var 



Var 



Var 



STURM (/,/') 
STURM (/, f'g) 

- a 

sturm (/, f'g 2 )] b a 









b - 




c f 


3 = 










a 








b 




c f 


g>0 










a 








b 




. Cf 


g<o 


a - 



□ 



8.5 Real Algebraic Numbers 

In this section, we study how real algebraic numbers may be described and 
manipulated. We shall introduce some machinery for this purpose, i.e., 
root separation and Thorn's lemma. 
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8.5.1 Real Algebraic Number Field 

Consider a field E. Let F be subficld in E. An element u E E is said to 
be algebraic over F if for some nonzero polynomial f(x) € ^[z], f(u) = 0; 
otherwise, it is transcendental over _F. 

Similarly, let S be a subring of E. An element ti e £ is said to be 
integral over S if for some monic polynomial f(x) £ S[x], f(u) = 0. 

For example, if we take E — C, and F = Q, then the elements of C 
that are algebraic over Q are the algebraic numbers; they are simply the 
algebraic closure of Q: Q. Similarly, if we take S = Z, then the elements 
of C that are integral over Z are the algebraic integers. 

Other useful examples are obtained by taking E = M, F = Q and S = Z; 
they give rise to the real algebraic numbers and real algebraic integers — 
topics of this section; they are in a very real sense a significant fragment of 
"computable numbers" and thus very important. 

Definition 8.5.1 (Real Algebraic Number) A real number is said to 
be a real algebraic number if it is a root of a univariate polynomial f(x) € 
Z[r\] with integer coefficients. □ 

Example 8.5.2 Some examples of real algebraic numbers: 

1. All integers: 

n € Z is a root of x — n = 0. 

2. All rational numbers: 

P 

a = - G Q (q 7^ 0) is a root of qx — p = 0. 

3. All real radicals of rational numbers: 

P = \f¥h e R > 0, q > 0) is a root of qx n - p = 0. □ 

Definition 8.5.3 (Real Algebraic Integer) A real number is said to be 
a real algebraic integer if it is a root of a univariate monic polynomial 
f(x) G Z[^] with integer coefficients. □ 

Example 8.5.4 The golden ratio, 

1 + V5 
4 2 , 

is an algebraic integer, since £ is a root of the monic polynomial 

x 2 - x - 1. □ 
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Lemma 8.5.1 Every real algebraic number can be expressed as a real al- 
gebraic integer divided by an integer. 

(This is a corollary of the following general theorem; the proof 
given here is the same as the general proof, mutatis mutandis. 
Every algebraic number is a ratio of an algebraic integer and an 
integer.) 



PROOF. 

Consider a real algebraic number £. By definition, 

£ = a real algebraic number 

<=> £ = real root of a polynomial 

f(x) = a n x n + a„_ia;" _1 H h a E Z[r<\, D K ^ K 

Now, if we multiply f{x) by a" , we have 

= < _1 a„ x n + a^an-x x^ 1 + ■■■ + a™" 1 ^ 
= (a n x) n + a n - 1 (a n x) n ~ 1 H h aoa" -1 - 

Clearly, a„£ is a real root of the polynomial 

g(y) =y n + a n -iy n ^ + ■■■ + aoa^ 1 € Z[rx], 

as g(a n £_) = = 0. Thus a„£ a real algebraic integer, for some 

a n e Z\{F}. □ 

Lemma 8.5.2 

7. 7/ a, /3 are rea^ algebraic numbers, then so are —a, a^ 1 (a ^ 0), 
a + f3, and a ■ (3. 

2. If a, (3 are real algebraic integers, then so are —a, a + (3, and a ■ (3. 
PROOF. 

1. (a) If a is a real algebraic number (defined as a real root of f{x) e 
Z[^]), then —a is also a real algebraic number. 

a = a real root of f{x) = a n x n + a„_ia;™ _1 + • • • + a 

— a = a real root of a n {— x) n + a n _i(— x) n ~ l + • • • + a 
—a = a real root of 

(-l) n a n x n + (-l) n - 1 a n -i(-x) n - 1 + ---+a Q e Z[a] 

(b) If a is a real algebraic integer, then —a is also a real algebraic 
integer. 
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2. If a is a nonzero real algebraic number (denned as a real root of 
/(x) € Z[^]), then 1/a is also a real algebraic number. 

a = a real root of /(x) — a n x n + a n -\x n ~ x + • • • + a 
<=> 1/a — a real root of 

x n a n (^j +x n a n - 1 (^j + --- + x"a 

1/a = a real root of a x n + aix™ -1 + ■ ■ ■ + a n e Z[^]. 

Note that, if a is a nonzero real algebraic integer, then 1/a is a real 
algebraic number, but not necessarily a real algebraic integer. 

3. (a) If a and (3 are real algebraic numbers (defined as real roots of 

f{x) and g(x) e Z[^], respectively), then a + (3 is also a real 
algebraic number, defined as a real root of 

Resultant,, (.f(x - y), g(y)). 

a = a real root of f(x) and /? = a real root of g(x) 
x — a = a real root of f(x — y) € (Z[-<n])[rv], 
/? = a real root of g(y) e (Z[W])[rv] and 
.x — a = /3 = a common real root of f(x — y) and g(y) 
44> x = a + (3 = a real root of 

Resultant y (/(x — y),g(y)) e Z[*r\]. 

(b) If a and /3 are real algebraic integers, then a + (3 is also a real 
algebraic integer. 

4. (a) If a and /3 are real algebraic numbers (defined as real roots of 

f(x) and g{x) <G Z[^], respectively), then a [3 is also a real 
algebraic number, defined as a real root of 

Resultant,, (y m f , , where to = deg(/). 

a = a real root of /(x) and /? = a real root of g(x) 
<S> - = a real root of t/ m /( - | e (ZM)[rvl, 

/3 = a real root of g{y) € (Z[^])[r>] and 

— = /3 = a common real root of y m f { — ) and o(«) 

" \yj 

x = a (3 = a real root of 

Resultant,, (y m f(^j, g(y)J g ZH. 
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(b) If a and [3 are real algebraic integers, then a/3 is also a real 
algebraic integer. □ 

Corollary 8.5.3 

1. The real algebraic integers form a ring. 

2. The real algebraic numbers form a field, denoted by A. D 

Since an algebraic number a, by definition, is a root of a nonzero poly- 
nomial f(x) over Z, we may say that / is en's polynomial. Additionally, 
if / is of minimal degree among all such polynomials, then we say that it 
is a's minimal polynomial. The degree of a nonzero algebraic number is 
the degree of its minimal polynomial; and by convention, the degree of is 
— oo. It is not hard to see that an algebraic number has a unique minimal 
polynomial modulo associativity; that is, if f(x) and g(x) are two minimal 
polynomials of an algebraic number a, then f(x) w g(x). 

If we further assume that a is a real algebraic number, then we can talk 
about its minimal polynomial and degree just as before. 

Theorem 8.5.4 The field of real algebraic numbers, A, is an Archimedean 
real closed field. 

PROOF. 

Since Acl and since R itself is an ordered field, the induced ordering on 
A defines the unique ordering. 

A is Archimedean: Consider a real algebraic number a, defined by its 
minimal polynomial f(x) £ Z[W|: 

f(x) = a n x n + a„_ia; n_1 H h o , 

and let N = 1 + max(|a„_i|, . . ., |a |) € Z. Then a < N. 

A is real closed: Clearly every positive real algebraic number a (defined 
by its minimal polynomial f{x) e Z[W|) has a square root \fa € A defined 
by a polynomial f(x 2 ) e "L[r\\. Also if f{x) G A[^] is a polynomial of odd 
degree, then as its complex roots appear in pair, it must have at least one 
real root; it is clear that this root is in A. D 

8.5.2 Root Separation, Thorn's Lemma and 
Representation 

Given a real algebraic number a, we will see that it can be finitely rep- 
resented by its polynomial and some additional information that identifies 
the root, if a's polynomial has more than one real root. 

If we want a succinct representation, then we must represent a by its 
minimal polynomial or simply a polynomial of sufficiently small degree 
(e.g., by asking that its polynomial is square- free) . In many cases, if we re- 
quire that even the intermediate computations be performed with succinct 
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representations, then the cost of the computation may become prohibitive, 
as we will need to perform polynomial factorization over Z at each step. 
Thus the prudent choice seems to be to represent the inputs and outputs 
succinctly, while adopting a more flexible representation for intermediate 
computation. 

Now coming back to the component of the representation that identifies 
the root, we have essentially three choices: order (where we assume the real 
roots are indexed from left to right), sign (by a vector of signs) and interval 
(an interval [a, b] C M that contains exactly one root). Again the choice 
may be predicated by the succinctness, the model of computation, and the 
application. 

Before we go into the details, we shall discuss some of the necessary tech- 
nical background: namely, root separation, Fourier sequence and Thorn's 
lemma. 

Root Separation 

In this section, we shall study the distribution of the real roots of an 
integral polynomial f(x) G Z[^]. In particular, we need to determine how 
small the distance between a pair of distinct real roots may be as some 
function of the size of their polynomial. Using these bounds, we will be 
able to construct an interval [a, b] (a, b e Q) containing exactly one real 
root of an integral polynomial /(x), i.e., an interval that can isolate a real 
root of f(x). 

We keep our treatment general by taking f(x) to be an arbitrary poly- 
nomial (not just square- free polynomials). Other bounds in the literature 
include cases (1) where f{x) may be a rational complex polynomial or a 
Gaussian polynomial; (2) where f(x) is square-free or irreducible; or (3) 
when we consider complex roots of f(x). For our purpose, it is sufficient to 
deal with the separation among the real roots of an integral polynomial. 

Definition 8.5.5 (Real Root Separation) If the distinct real roots of 
f(x) e Z[r<\ are a u ...,a t (l> 2), 

a± < C(2 < ■ ■ ■ < Cil, 

then define the separation 1 of / to be 

Separation(/) = min (aj — an). 

l<i<j<l 

If / has less than two distinct roots, then Separation(/) = oo. □ 
The following bound is due to S.M. Rump: 



Or more exactly, real separation. 
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Theorem 8.5.5 (Rump's Bound) Let f(x) G be an integral poly- 

nomial as follows: 

f[x) — a n x n + a n ^\x n ^ 1 H ha , a, e Z. 

Then 2 , 

Separationf f) > — , ... .„ . 

y->> n n+l( 1 + || / || 1 )2n 



PROOF. 

Let h G M. be such that, for any arbitrary real root, a, of /, the polynomial 
/ remains nonzero through out the interval (a, a + h). Then, clearly 

Separation(/) > h. 

Using the intermediate value theorem (see Problem 8.2), we have 

f(a + h) = f(a) + hf'(n), for some \x G (a, a + h). 

Since f{a) = 0, we have 

u _ \.f(a + h)\ 
|/'0*)l ' 

Thus we can obtain our bounds by choosing h such that |/(a + h)\ is 
"fairly large" and |/'(/i)|, "fairly small," for all pL G (a, a + h). Thus the 
polynomial needs enough space to go from a "fairly large" value to at a 
"fairly small" rate. 

Let [3 be a real root of f(x) such that [3 is immediately to the right of 
a. Then there are following two cases to consider: 

• Case 1: \a\ < 1 < \[3\. 

We consider the situation — 1 < a < 1 < f3, as we can always achieve 
this, by replacing f(x) by f(—x), if necessary. Take a + h = 1. Then 

1. \f(a + h)\ >1, since /(l) ^ 0. 

2. Since fi < 1, we have 

|/'(M)I < |na n | + |(n-l)a n _i| + --- + |ai| < n||/||i. 

Thus 

1 1 
Separation(J) > h > > 



nll/Hi n™+i(l + ||/||i) 2 "' 



2 Note: A tighter bound is also given by Rump: 

2\/2 

Separation(/) > w „ /2+1(1 + | 
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For the next case, we need the following technical lemma: 

Lemma 8.5.6 Let f(x) € Z[-r>] be an integral polynomial as in the theo- 
rem. If 7 satisfies 

/'( 7 ) = 0, but /( 7 )^0, 

then 

1/(7)1 > 1 



^(1 + H/lli) 2 "- 1 ' 

PROOF. 

Consider the polynomials /'(x) and f(x,y) = f(x) — y. Since 7 is a zero 
of f'(x) and (7, /( 7 )} is a zero of /(x, y), we see that /( 7 ) 7^ is a root of 

R(y) = Resultant^ (/'(x), f(x,y)). 

Using the Hadamard- Collins-Horowitz inequality theorem (with 1-norm) 
(see Problem 8.11), we get 

Halloo < (ll/lli + l)"- 1 (nil/Hi)", 

and 

1 + Wloo < ^(l + H/Hx) 2 "- 1 . 

Since /( 7 ) is a nonzero root of the integral polynomial R(y), we have 

1/(7)1 > 1 + Woc > ^(1 + H/lll) 2 "- 1 ' D 

(End of Lemma) 



Case 2: \a\, \(3\ < 1. 

By Rolle's theorem there is a 7 <G (a, /?), where /'( 7 ) = 0. Take 
a + h = "f. Then 

1. By Lemma 8.5.6, 

1 



l/(° + *)l = 1/(7)1 > ^(i + ii/ii^-i- 
2. As before, since ^ < 1, we have 

|/'(M)l<n||/||i. 

Thus 

Separation(/) > h 

1 

> 



> 



^(i + ll/lli) 2 "- 1 )^!!/!!!) 
1 



^ +1 (i + ll/lli) 2n ' 
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Note: The remaining case \a\, \0\ > 1, requires no further justifica- 
tion as the following argument shows: Consider the polynomial x n f(l/x). 
Then a -1 and f3~ 1 are two consecutive roots of the new polynomial of same 
"size" as f(x); {a' 1 ], I/?" 1 ! < 1; and 

0-a > ^ = a^-0~\ 
a 

But using case 2, we get 

Separation^) > - a > ± . □ 



Let r = p/5 6 Q be a rational number (p, q e Z). We define size of r 

as 

size(r) = \p\ + \q\. 

Theorem 8.5.7 Let f(x) e "L[r\\ be an integral polynomial of degree n. 
Then between any two real roots of f(x) there is a rational number r of size 

size(r) <2-n n+1 (l + ||/||i) 2n+1 . 

PROOF. 

Consider two consecutive roots of f(x): a < {3 . Let QeZbe 

Q = n"+ 1 (l + ||/|| 1 ) 2 " 
Consider all the rational numbers with Q in the denominator: 

i i-l 2 112 i - 1 i 

■ ■ ■ ' ~Q' ~~Q~' ' • ' ' " Q' Q' Q' • ' • ' ~Q~' Q' ' ' ' 

Since 

0-a>l/Q, 

for some P, 

a < ^ < P < Wfh, 



since is a root of /. 
Hence 



\P\ < n n+1 {\ + 11 ^ll-^ 2 " +1 



Thus the size of r = P/Q is 

size(r) < 2 • + ||/||i) 2 " +1 - □ 

Definition 8.5.6 (Isolating Interval) An isolating interval for an inte- 
gral polynomial f(x) £ is an interval [a, b] (a, b € Q), in which there 
is precisely one real root of the polynomial. □ 
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By the preceding theorem, we see that for every real root of an integral 
polynomial, we can find an isolating interval. Furthermore, 

Corollary 8.5.8 Every isolating interval of a degree n integral polynomial 
f(x) G Jj[r\] is of the form [a, b], (a, b <E Q) and size(a) and size(6) are no 
larger than 

2-n" +1 (l + ||/||i) 2n+1 . □ 

Hence every isolating interval can be represented with 0(n(lg n +/?(/))) 
bits, where (3(f) — 0(lg ||/||i) is the bit-complexity of the polynomial f(x) 
in any reasonable binary encoding. An important related computational 
problem is the so-called root isolation problem, where we are asked to com- 
pute an isolating interval of an integral polynomial f(x). 

RootIsolation(/(x)) 

Input: f{x) e ZM}; dog(/) = n. 

Output: An isolating interval [a, b] (a, b £ Q), 
containing a real root of f(x). 

Let S — STURM (/, /'), be a Sturm sequence; 
a:=-||/||i; b:=ll/l|i; 

if Vara(S') = Vaxb(S) then return failure; 

while V&r a (S) - Var 6 (S) > 1 loop 
c:= (o + 6)/2; 

if Var a (S) > Var c (5') then b := c else a := c 
end{if }; 
end{loop }; 

return [a, b]; 

end{ROOTlSOLATION} D 

Note that each polynomial of the Sturm sequence can be evaluated by 
0(n) arithmetic operations, and since there are at most n polynomials in 
the sequence, each successive refinement of the interval [a, b] (by binary 
division) takes 0(n 2 ) time. Thus the time complexity of root isolation is 



0(n 2 ) ■ O ( lg 
which is simplified to 



( n/h \ 

\ Separation(/) J 



OU»lr' 211/111 



n- n - 1 (l + \\f\\i)- 2n / 
= 0(n 2 lg(2n"+ 1 (l + ||/|| 1 ) 2 "+ 1 )) 
= 0(n 3 (lgn + /?(/))), 



Section 8.5 



Real Algebraic Numbers 



325 



where /?(/), as before, is the bit-complexity of the polynomial f(x). 

Fourier Sequence and Thorn's Lemma 

Definition 8.5.7 (Fourier Sequence) Let f(x) G K[^] be a real uni- 
variate polynomial of degree n. Its Fourier sequence is defined to be the 
following sequence of polynomials: 

FOURIER(/) =( fW(x) = f(x), 

fW(x) = f'(x), 
f {2 \x), 

f in) (x) ), 

where /W denotes the i th derivative of / with respect to x. □ 

Note that fourier(/') is a suffix of fourier(/) of length n; in general, 
fourier(/W) is a suffix of fourier(/) of length n — i + l. 

Lemma 8.5.9 (Little Thorn's Lemma) Let f(x) £ M.[-r\] be a real uni- 
variate polynomial of degree n. Given a sign sequence s: 

s = (s , Si, S2, . . . , s n ), 

we define the sign-invariant region of K determined by s with respect to 
fourier(/) as follows: 

R(s) = {C e R : sgn(U^ (0) = ~n, for all D = K, . . . , X }. 

Then every nonempty R(s) must be connected, i.e., consist of a single in- 
terval. 

PROOF. 

The proof is by induction on n. The base case (when n = 0) is trivial, 
since, in this case, either R(s) = R (if sgn(/(x)) = s ) or R(s) = (if 
sgn(/(z)) ^ s ). 

Consider the induction case when n > 0. By the inductive hypothesis, 
we know that the sign-invariant region determined by s', 

s' = (si, S 2 , . . . , S n ), 

with respect to fourier(/') is cither or a single interval. If it is empty, 
then there is nothing to prove. Thus we may assume that 

R(s') + 0. 
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Now, let us enumerate the distinct real roots of f(x): 

6 < 6 <•••< Cm- 
Note that if R(s') consists of more than one interval, then a subsequence 
of real roots 

&<&+i<---<0> j-i>i 
must lie in the interval R(s'). Now, since there are at least two roots £j and 
€ R(s'), then, by Rolle's theorem, for some £' € -R(s') < £' < i ) , 
/'(£') = 0. Thus, 

(V^e [/'(*) - 0] => f'(x) = => deg(/) - 0, 

a contradiction. □ 

Corollary 8.5.10 Consider two real roots £ and (, of a real univariate 
polynomial f{x) G R[^] of positive degree n > 0. T/ien £ = £, i/, /or some 
< to < n, ffce following conditions hold: 

/M(0 = /M(C) = 0j; 
sgn(/< m+1 >(0 = sgn(/( m+1 )(C), 

sgn(/^(0 = sgn(/(")(C). 

5" = (0,sgn(.f( m+1 )(C),...,sgn(/(")(C)), 

and R(s") be the sign-invariant region determined by s" with respect to 
FOURiER(/ (m) ). Since £ and ( e i?(s"), wc scc that i?(s") is a nonempty 
interval, over which /( m ) vanishes. Hence /( m ) is identically zero. But this 
would contradict the fact that deg(/) = n > to > 0. D 

Let us dehne sgn^ (fourier(J) ) to be the sign sequence obtained by 
evaluating the polynomials of fourier(J) at £: 

Sgn ? (FOURIER(/)) 

= (sgn(/(0), sgn(/'(0), sgn(/( 2 )(0), . . . ,sgn(/W(0) 

As an immediate corollary of Thorn's lemma, we have: 

Corollary 8.5.11 Let £ and £ &e two real roots of a real univariate poly- 
nomial f(x) € R[W] of positive degree n > 0. TTiera £ = £ 7 i/ ifte following 
condition holds: 

Sgn^(FOURIER(/')) = Sgn c (FOURIER(/')). □ 



PROOF. 

Let 
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Representation of Real Algebraic Numbers 

A real algebraic number a G A can be represented by its polynomial 
f{x) £ an integral polynomial with a as a root, and additional 

information identifying this particular root. Let 

f(x) = a n x n + a„_i x n ~ x H V a , deg(/) = n, 

and assume that the distinct real roots of f(x) have been enumerated as 
follows: 

a\ < 0L2 < ■ ■ ■ < oij-\ < aj — a < ctj+i < ■ ■ ■ < ati, 

where I < n — deg(f). 

While in certain cases (i.e., for input and output), we may require f(x) 
to be a minimal polynomial a, in general, we relax this condition. 

The Following is a list of possible representations of a: 

1. Order Representation: The algebraic number a is represented as 
a pair consisting of its polynomial, /, and its index, j, in the sequence 
enumerating the real roots of /: 

(a)o = (f,j). 

Clearly, this representation requires only 0(nlg ||/||i + logn) bits. 

2. Sign Representation: The algebraic number a is represented as 
a pair consisting of its polynomial, /, and a sign sequence, s, repre- 
senting the signs of the Fourier sequence of /' evaluated at the root 
a: 

(a), = (f,S = Sgn Q (FOURIER(/'))). 

The validity of this representation follows easily from the Little Thorn's 
theorem. The sign representation requires only 0{n lg +n) bits. 

3. Interval Representation: The algebraic number a is represented 
as a triple consisting of its polynomial, /, and the two end points of 
an isolating interval, (7, r) (I, r <E Q, I < r) containing only a: 

(a)i = (f,l,r). 

By definition, 

max(aj_i,-l - H/lloo) <l<a 3 =a<r< min(a,- + i, 1 + ||/||oo), 

using our bounds for the real root separation, we see that the interval 
representation requires only 0(nlg ||/||i + nlgn) bits. 

Additionally, we require that the representation be normalized in the 
sense that if a ^ then ^ (I, r), i.e., / and r have the same sign. We 
will provide a simple algorithm to normalize any arbitrary interval 
representation. 
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Example 8.5.8 Consider the representations of the following two alge- 
braic numbers —\pl + \/3 and \[2 + ^/3: 



-V2 + V3) 


= (- 4 - 


lOx 2 


+ 1, 


3), 


(V2 + V3) 


= <* 4 - 


lOx 2 


+ 1, 


4), 


-V2 + V3) s 


= (x 4 - 


lOx 2 


+ 1, 


(-1,-1, +1)), 


(V2 + V3) s 


= 


lOx 2 


+ 1, 


(+1,+1,+1)), 


-V2 + V3) l 


= <* 4 - 


lOx 2 


+ 1, 


1/11, 1/2), 


(V2 + V3)i 


= <- 4 - 


lOx 2 


+ 1, 


3, 7/2). □ 



Here, we shall concentrate only on the interval representation, as it appears 
to be the best representation for a wide class of models of computation. 
While some recent research indicates that the sign representation leads to 
certain efficient parallel algebraic algorithms, it has yet to find widespread 
usage. Moreover, many of the key algebraic ideas are easier to explain 
for interval representation than the others. Henceforth, unless explicitly 
stated, we shall assume that the real algebraic numbers are given in the 
interval representation and are written without a subscript: 

(a) = {f,l,r). 

The following interval arithmetic operations simplify our later exposi- 
tion: 

Let ii = (h, n) = {x : h < x < n} and J 2 = (h, r 2 ) = {x : l 2 < 
x < r 2 } be two real intervals; then 

h+h = (h+l2,n+r 2 ) 

= {x + y : h < x < n and l 2 < x < r 2 }, 

h-h = {h - r 2 , n - h) 

= {x - y : h < x < ri and l 2 < x < r 2 }, 

h-I 2 = (min(7iZ 2 , hr 2 , nl 2 , nr 2 ), max(/i/ 2 , hr 2 , nl 2 , nr 2 )) 
= {xy : h < x < ri and l 2 < x < r 2 }. 

We begin by describing algorithms for normalization, refinement and 
sign evaluation. 
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Normalization 

NORMALIZE(a) 

Input: A real algebraic number a = (f,l,r) G A. 

Output: A representation of a = (/,/', r') such that g (l',r'). 

p== 1/(1 + 11/11-); 

if Var ; (S) > Var_ p (S) then 

return a = (/, Z, — p) 
elsif Var_ p (S) > Var p (5) then 

return a = 

else 

return a = (/, p, r) 
end{if }; 
end {Normalize} □ 



The correctness of the theorem is a straightforward consequence of 
Sturm's theorem and bounds on the nonzero zeros of /. It is easily seen 
that the algorithm requires 0(n 2 ) arithmetic operations. 



Refinement 

Refine(oi) 

Input: A real algebraic number a — {f,l,r} G A. 

Output: A finer representation of a — (/,/', r') such that 
2(r' - < (r - I). 

Let S — STURM (/, /') be a Sturm sequence; 

m := (1 + r)/2; 
if Van (S) > Var m (5) then 
return (/, I, m) 

else 

return (/, m, r) 
end{if }; 
end{REFlNE} □ 



Again the correctness of the algorithm follows from the Sturm's theorem 
and its time complexity is 0(n 2 ). 
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Sign Evaluation 

SiGN(a,g) 

Input: A real algebraic number a = (/, I, r) £ A, and 
a univariate rational polynomial g(x) € Q[*~>]- 

Output: sgn(g(a)) = sign of g at a. 

Let S g = STURM (/, f'g), be a Sturm sequence; 

return Vaxi(S g ) — Var r (5 9 ); 
end{SlGN} □ 

The correctness of the algorithm follows from Sturm- Tarski theorem, 
since 



Var; (sturm (/, f'g)) - Var r (STURM(/, f'g)) 



g>0 



c f 



g<0 



+1, if g(a) > 

0, if g(a) = 

-1, if 5(a) <0 

and since / has only one root a in the interval (l,r). The algorithm has 
a time complexity of 0(n 2 ). This algorithm has several applications: for 
instance, one can compare an algebraic number a with a rational number 
p/q by evaluating the sign of the polynomial qx — p at a; one can compute 
the multiplicity of a root a of a polynomial / by computing the signs of 
the polynomials /', f 2 \ f 3 \ etc., at a. 



Conversion Among Representations 

Interval representation to order representation: 

IntervalToOrder(q) 

Input: A real algebraic number (a), = (f,l,r) G A. 
Output: Its order representation (a} = (f,j). 

Let S — STURM (/, /') be a Sturm sequence; 

return (/, Var^^n^^S) - Var r (S)>; 

end{lNTERVALToORDER} □ 
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Interval representation to sign representation: 

iNTERVALToSlGN(a) 

Input: A real algebraic number (a); = (f,l,r) G A. 
Output: Its sign representation {a) s = (f,s). 
Let </',/< 2 >,...,/<">> = fouiuer(/'); 

s := ^ SlGN(a,/'), 
SlGN(a,/ (2) ), 

SlGN(a,/ (n) ) 
return (/,s); 

end{lNTERVALToORDER} □ 

Again, the correctness of these algorithms follow from Sturm- Tarski 
theorem. The algorithms IntervalToOrder has a time complexity of 
0{n 2 ), and IntervalToSign has a complexity of 0(n 3 ). 

Arithmetic Operations 

Additive inverse: 

AdditiveInverse(q) 

Input: A real algebraic number (a) = (f,l,r) G A. 

Output: — a in its interval representation. 

return (f(—x), —r,—l); 
end{ AdditiveInverse} □ 

The correctness follows from the fact that if a is a root of f(x), then 
—a is a root of f(—x). Clearly, the algorithm has a linear time complexity. 

Multiplicative inverse: 

MultiplicativeInverse(ck) 

Input: A nonzero real algebraic number (a) = (/, I, r) G A. 
Output: 1/a in its interval representation. 

return (.— /(I), 1,1); 

end{MULTIPLICATIVElNVERSE} Q 
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The correctness follows from the fact that if a is a root of f(x), then 
1/a is a root of x d ° s ^ f(l/x). Again, the algorithm has a linear time 
complexity. 

Addition: 



Addition(cvi, a 2 ) 

Input: Two real algebraic numbers 

(ai) = and (a 2 ) = (/2,^2,r 2 ) G A. 

Output: ai + 02 = «3 = (f3,h,r3) in its interval representation. 

/ 3 := Resultant,, (/1 (a; - y), h{y))\ 

S:= Sturm(/ 3 ,/3); 

^3 := '1 + '2; 
r-3 := n + r 2 ; 

while Var i3 (5) - Var r3 (5) > 1 loop 

(fi,h,ri) := REFlNE((/i,Zi,n»; 
(f2,h,r 2 ) := Refine((/ 2 , Z 2 ,r 2 }); 

Z3 := '1 + Z 2 ; 
r3 := ri + r 2 ; 
end{loop }; 

return (f s ,l s ,r s ); 
end{ADDlTlON} D 

The correctness of the algorithm follows from the main properties of 
the resultant and the repeated refinement process yielding an isolating in- 
terval. The resulting polynomial fz in this algorithm has the following size 
complexities: 

deg(/ 3 ) < m ri2, 
ll/slli < 2°("^) HAUr H/2II" 1 , 

where deg(/i) = ni and deg(/ 2 ) = n 2 . It can now be shown that the 
complexity of the algorithm is 0(n\n\ lg ||/i||i + n\n\ lg H/2II1). 
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Multiplication: 

Multiplication^, 02) 

Input: Two real algebraic numbers 

(ai) = (/i,/i,n) and (a 2 ) = (/ 2 ,Z2,r 2 ) G A. 

Output: ai • a.i = «3 = (/ 3 ,Z 3 ,r 3 ) in its interval representation. 

/ 3 := Resultant, ^y**(/i) / x Q , / 2 („) j ; 

5:= STURM (/ 3 , /a); 

Z 3 := min(Zi/2,/ir2,ri/2,r-ir2); 
r 3 := max(/i/ 2 ,/ir2,riZ 2 ,rir2); 

while Var; 3 (5) - Var r3 (S) > 1 loop 

(fi,h,ri) := REFiNE((/i,/i,n)); 
(f2,l2,r 2 ) := REFlNE((/ 2 ,Z 2 ,r 2 }); 

/ 3 := min(/i; 2 ,/ir 2 ,riZ 2 ,rir 2 ); 
r 3 := m&x(lil2,hr 2 ,ril2,r 1 r 2 ); 
end{loop }; 

return (/ 3 , Z 3 , r 3 ); 
end {Multiplication} □ 

The correctness of the multiplication algorithm can be proven as before. 
The size complexities of the polynomial fa are: 

deg(/ 3 ) < ni n 2 , 
ll/slli < m n 2 H/xlir ||/ 2 ||r, 



where deg(/i) = m and deg(/2) = ri2- Thus, the complexity of the algo- 
rithm is 0{n\r4 lg IIMIj + njnl lg ||/ 2 ||i). 

8.6 Real Geometry 

Definition 8.6.1 (Semialgebraic Sets) A subset S C M K is a said to 
be a semialgebraic set if it can be determined by a set-theoretic expression 
of the following form: 

5 = U 11 eM>< : s g n (^,3(^,---,^)) = ~3,3}, 

i=l j=l 
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where fij's are multivariate polynomials in . . ., x n ]' 

/, J (ii,...,a:„)eM[^,...,^ [< ], D = JK,...,>, J=H z ,...,<n, 

and Sij's are corresponding set of signs in {—1, 0, +1}. D 

Such sets arise naturally in solid modeling (as constructive solid geomet- 
ric models), in robotics (as kinematic constraint relations on the possible 
configurations of a rigid mechanical system) and in computational algebraic 
geometry (as classical loci describing convolutes, evolutes and envelopes). 

Note that the semialgcbraic sets correspond to the minimal class of 
subsets of K K of the following forms: 

1. Subsets defined by an algebraic inequality: 



where f(xi, . . ., x n ) e . . ., 

2. Subsets closed under following set-theoretic operations, complemen- 
tation, union and intersection. That is, if Si and S2 are two semial- 
gebraic sets, then so are 



After certain simplifications, we can also formulate a semialgcbraic set 
in the following equivalent manner: it is a finite union of sets of the form 
shown below: 



where the gi(xi, . . ., x n ) e M.[r\r, ■ ■ ., x„] (i = 1, . . ., s). 

It is also not hard to see that every propositional algebraic sentence 
composed of algebraic inequalities and Boolean connectives defines a semi- 
algebraic set. In such sentences, 

1. A constant is a real number; 

2. An algebraic variable assumes a real number as its value; there are 
finitely many such algebraic variables: Xi, X2, ■ ■ ., x n . 




S = 



9i(€i, ■ ••,&») = ■■■ = g r (€i, ■ ■ ■ ,£n) = 0, 

ffr+l^l, • • -,€n) > 0, • • • , g s (£i, ...,$„)> |, 
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3. An algebraic expression is a constant, or a variable, or an expression 
combining two algebraic expressions by an arithmetic operator: "+" 
(addition), "— " (subtraction), "•" (multiplication) and "/" (division). 

4. An atomic Boolean predicate is an expression comparing two arith- 
metic expressions by a binary relational operator: "=" (equation), 
V" (inequation), ">" (strictly greater), "<" (strictly less), ">" 
(greater than or equal to) and "<" (less than or equal to). 

5. A propositional sentence is an atomic Boolean predicate, a negation of 
a propositional sentence given by the unary Boolean connective: 
(negation), or a sentence combining two propositional sentences by 
a binary Boolean connective: "=>" (implication), "A" (conjunction) 
and "V" (disjunction). 

Thus another definition of scmialgebraic sets can be given in terms 
of a propositional algebraic sentence tp(xi, . . ., x n ) involving n algebraic 
variables: This is a subset S C M K determined by ip as follows: 

S={«i UeR K : *K&,- ■■,£«) =True}. 

It turns out that if we enlarge our propositional sentences to include 
first-order universal and existential quantifiers (the so-called Tarski sen- 
tences), then the set determined by such sentences (Tarski sets) are also 
scmialgebraic sets. That is, quantifiers do not add any additional power! 
This result follows from Tarski's famous quantifier elimination theorem, 
which states that any set S determined by a quantified sentence is also 
determined by a quantifier- free sentence ip' , or 

Proposition 8.6.1 Every Tarski set has a quantifier-free defining sen- 
tence. □ 

Later on in this section, we shall come back to a more detailed discussion 
of Tarski sentences and some results on an effective procedure to decide if 
a given Tarski sentence is true — geometric theorem proving problem. 

Example 8.6.2 Some examples of such propositional algebraic sentences 
include the following: 
1. 

<f> 1 (x 1 ,x 2 ,...,x n ) = x\+xl + Vx 2 n <Q. 

The semialgebraic set defined by is the empty set. 

2. 

(x 4 - 10x 2 + 1 = 0) A (2x < 7) A [x > 3) 

defines the only root of the polynomial x 4 — 10x 2 + 1 in the isolating 
interval [3, 7/2] — namely, the real algebraic number \A2 + V3- 
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3. 

(x 2 + bx + c = 0) A (y 2 + by + c = 0) A (x^y), 

which has a real solution (x, y) if and only if b 2 > 4c, i.e., when the 
quadratic polynomial x 2 + bx + c has two distinct real roots. □ 

The following properties of semialgebraic sets are noteworthy: 

• A semialgebraic set S is semialgebraically connected, if it is not the 
union of two disjoint nonempty semialgebraic sets. 

A semialgebraic set S is semialgebraically path connected if for every 
pair of points p, q € S there is a semialgebraic path connecting p and 
q (one-dimensional semialgebraic set containing p and q) that lies in 
S. 

• A semialgebraic set is semialgebraically connected if and only if it is 
semialgebraically path connected. Working over the real numbers, it 
can be seen that a semialgebraic set is semialgebraically path con- 
nected if and only if it is path connected. Thus, we may say a semial- 
gebraic set is connected when we mean any of the preceding notions 
of connectedness. 

• A connected component (semialgebraically connected component) of a 
semialgebraic set S is a maximal (semialgebraically) connected subset 
of S. 

• Every semialgebraic set has a finite number of connected components. 

• If S is a semialgebraic set, then its interior, int(S'), closure, S, and 
boundary d(S) = S \ int(S) are all semialgebraic. 

• For any semialgebraic subset S C M K , a semialgebraic decomposition 
of S is a finite collection JC of disjoint connected semialgebraic subsets 
of S whose union is S. Every semialgebraic set admits a semialgebraic 
decomposition. 

Let T = {fij ■ i = 1, • • ., m, j = 1, . . ., l{\ C . . ., x n ] be a set 

of real multivariate polynomials in n variables. Any point p = 
. . ., £ n ) € R K has a sign assignment with respect to T as follows: 

sgn^-(p) = (sgntfijfa, . . .,£„)) :i = l,...,m, j = 1, . . . ,k^. 

Using sign assignments, we can define the following equivalence rela- 
tion: Given two points p, q € R* , we say 



p q, if and only if sgn^(p) = sgn^-(g). 
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Now consider the partition of M K defined by the equivalence relation 
each equivalence class is a semialgebraic set comprising finitely 
many connected semialgebraic components. Each such equivalence 
class is called a sign class of T . 

Clearly, the collection of semialgebraic components of all the sign 
classes, JC, provides a semialgebraic decomposition of R K . Further- 
more, if S C R K is a semialgebraic set defined by some subset of T, 
then it is easily seen that 



defines a semialgebraic decomposition of the semialgebraic set S. 

• A semialgebraic cell-complex {cellular decomposition) for T is a semi- 
algebraic decomposition of M K into finitely many disjoint semialge- 
braic subsets, {Ci}, called cells such that we have the following: 

1. Each cell Ci is homeomorphic to R J P', < S(i) < n. 5(i) is 
called the dimension of the cell d, and d is called a S(i)-cell. 

2. Closure of each cell Ci, Ci, is a union of some cells Cj's: 



3. Each Ci is contained in some semialgebraic sign class of T — that 
is, the sign of each fij e T is invariant in each d. 

Subsequently, we shall study a particularly "nice" semialgebraic cell- 
complex that is obtained by Collin's cylindrical algebraic decomposi- 
tion or CAD. 

8.6.1 Real Algebraic Sets 

A special class of semialgebraic sets are the real algebraic sets determined 
by a conjunction of algebraic equalities. 

Definition 8.6.3 (Real Algebraic Sets) A subset Z C M K is a said to 
be a real algebraic set, if it can be determined by a system of algebraic 
equations as follows: 



where /j's are multivariate polynomials in R[Vv, • • ■, x n ]. □ 




d = \JC j . 



3 
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Figure 8.1: Projection of a real algebraic set Z. 



Note that a real algebraic set Z could have been denned by a single 
algebraic equation as follows: 

z = {(&,...,£„) ei K : ^(6k,---,€k) + -" + ^(6k,...,^)=^}, 

as we are working over the field of reals, M. 

While real algebraic sets are quite interesting for the same reasons as 
complex algebraic varieties, they lack certain "nice" geometric properties 
and hence, are somewhat unwieldy. For instance, real algebraic sets are not 
closed under projection onto a subspace. 

Consider the following simple real algebraic set defining a parabola: 

Z= {{x,y) eR" : ^ = rv K }. 

If ir x is a projection map defined as follows: 

7T : K 2 -> M 

: {x,y)^>x, 

then 

tt(Z) : ^ >F}. 



See Figure 8.1. 

Clearly, 7r(Z) is not algebraic, since only algebraic sets in R are finite 
or entire R. However, it is semialgebraic. Additionally, we shall see that 
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semialgebraic sets are closed under projection as well as various other set- 
theoretic operations. In fact, semialgebraic sets are the smallest class of 
subsets of 1 K containing real algebraic sets and closed under projection. 
In the next two subsection, we shall develop some machinery that among 
other things shows that semialgebraic sets are closed under projection. 

8.6.2 Delineability 

Let fi(xi, . . ., x n -i, x n ) £ R[-^f, . . ., x n -i, x n ] be a polynomial in n 
variables: 

fi {xi , . . . , X n —l, Xn) = fi (x\ , . . . , X n —\) X n l 

H 1" ff(xi, ■ ■ -,x n -i), 

where //' s are in . . ., x n -{\. Let p' = . . ., £ n _i) e R K ^. Then 

we write 

fi, P '(Xn) = f? i ( P ')x d r ;+ ■■■ + /?&), 

for the univariate polynomial obtained by substituting p' for the first (n— 1) 
variables. 

Definition 8.6.4 (Delineable Sets) Let 




f s {x u . . . ,x n ) | C R[a;i,...,a; n ] 

be a set of s n-variate real polynomials. Let C C M K_F be a nonempty set 
homeomorphic tolR' 5 (0 < 5 < n — 1). 

We say T is delineable on C (or, C is T- delineable) , if it satisfies the 
following invariant properties: 

1. For every 1 < i < s, the total number of complex roots of fay (count- 
ing multiplicity) remains invariant as p' varies over C. 

2. For every 1 < i < s, the number of distinct complex roots of fay (not 
counting multiplicity) remains invariant as p' varies over C. 

3. For every 1 < i < j < s, the total number of common complex roots 
°f fa,p' and fjy (counting multiplicity) remains invariant as p' varies 
over C. □ 
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Theorem 8.6.2 Let T C . . ., x n ] be a set of polynomials as in 

the preceding definition, and let C Q M^~ r be a connected maximal T- 
delineable set. Then C is semialgebraic. 
PROOF. 

We show that all the three invariant properties of the definition for delin- 
eability have semialgebraic characterizations, 
(f ) The first condition states that 

(V ij (Vp' € Cj \Z(fi, P ')\ = invariant 

where Z(f) denotes the complex roots of /. This condition is simply equiv- 
alent to saying that "deg(/ ijP /) is invariant (say, fcj)." A straightforward 
semialgebraic characterization is as follows: 



•)( 



y Ki< s) [30 <h <di 



(Vk>h) [f*(x 1 ,...,x n - 1 ) = 0] A f? i (x 1 ,...,x n - 1 )^0 

holds for all p' G C . 

(2) The second condition, in view of the first condition, can be restated 
as follows: 

(vi) (Vj/eC) [\CZ(f i!P ,, D Xn (f iy )) | = invariant], 

where D Xn denotes the formal derivative operator with respect to the vari- 
able x n and CZ(f,g) denotes the common complex roots of / and g. Using 
principal subresultant coefficients, we can provide the following semialge- 
braic characterization: 

(Vl<i<s) (3 < U < di - l) 

(V I < h) [PSCf» (fi( Xl , . . . , x n ), D x JMx u x n ))) = 0] 

A PSCf 4 » ...,x n ),D Xn {f i (x 1 ,...,x n )))?6 

holds for all p' e C; here PSC^" denotes the I th principal subresultant 
coefficient with respect to x n . 

(3) Finally, the last condition can be restated as follows: 



Vz^jJ (Vj/eCJ [\CZ(f iiP ,,f j>p ,)\ = invariant 

Using principal subresultant coefficients, we can provide the following semi- 
algebraic characterization: 

(V 1 < i < j < (3 0< mij < min(dj, djfj 

(V m < m^) \PSG%(fi(xi, . . -,x n ), fj(x lt . . .,x n )) = 0] 
A PSC^T (fi(xu x n ),fj{xi, x n )) ^ o| , 
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holds for all p' G C; here PSC^ denotes the m th principal subresultant 
cocfBcient with respect to x n . □ 

In summary, given a set of polynomials, T G R[^f, • • x n ], as shown 
below, we can compute another set of (n — l)-variate polynomials, G 
K[^^, . . ., x n -i], which precisely characterizes the connected maximal 
J^-delineable subsets of Let 

F = f2, ■ ■ ■ , /s|; 

then 

= {/*(xi,...,a:n-i : 1 < » < «, < fc < d,} 

U {pSCf " (/iCsi, . . . , *„), D Xn (fi( Xl , . . . , *„)) : 

1 < i < s, < I < dk - l} 
U {PSC^»(/ i (a;i,...,a: n ),/ j (a;i,...,a: n )) : 

l!i*<i!i s ! 0<m< min(rfi,rfj)|. 

Now, we come to the next important property that delineability pro- 
vides. Clearly, by definition, the total number of distinct complex roots of 
the set of polynomials T is invariant over the connected set CCl K_f . But 
it is also true that the total number of distinct real roots of T is invariant 
over the set C. 

Consider an arbitrary polynomial fa G T; since it has real coefficients, 
its complex roots must occur in conjugate pairs. Thus as /j ;P ' varies to 
fi t qi such that some pair of complex conjugate roots (which are necessarily 
distinct) coalesce into a real root (of multiplicity two), somewhere along 
a path from p' to q' the total number of distinct roots of / must have 
dropped. Thus a transition from a nonreal root to a real root is impossible 
over C . Similar arguments also show that a transition from a real root to 
a nonreal is impossible as it would imply a splitting of a real root into a 
pair of distinct complex conjugate roots. 

More formally, we argue as follows: 

Lemma 8.6.3 Let T C K.[^^, . . x n ] be a set of polynomials as before, 
and let C C R K ~ F be a connected T -delineable set. Then the total number 
of distinct real roots of T is locally invariant over the set C. 
PROOF. 

Consider a polynomial /, G T . Let p' and q' be two points in C such that 
\\p' — q'\\ < e and assume that every root of fi tP > differs from some root of 
fi t q' by no more that 




Separation(/j jP / ) , 
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where Separation denotes the complex root separation of fi tP > . 

Now, let Zj be a complex root of fiy such that a disc of radius 5 centered 
around Zj in the complex plane contains a real root yj of fi A '. But then it 
can also be shown that yj is also in a disc of radius S about ~zj, the complex 
conjugate of Zj. But then Zj and zj would be closer to each other than 
Separation(/i. p /), contradicting our choice of 6. Thus the total number of 
distinct complex roots as well as the total number of distinct real roots of 
fi remains invariant in any small neighborhood of a point p' e C. □ 

Lemma 8.6.4 Let T C R[-/^ 7 . . x n ] and C C M K_F be a connected T- 
delineable set, as in the previous lemma. Then the total number of distinct 
real roots of J 7 is invariant over the set C. 

PROOF. 

Let p' and q' be two arbitrary points in C connected by a path 7 : [0, 1] — > C 
such that 7(0) = p' and 7(1) = q' . Since 7 can be chosen to be contin- 
uous (even semialgebraic) the image of the compact set [0, 1] under 7, 
r = 7([0, 1]) is also compact. At every point r' £ T there is a small neigh- 
borhood N(r') over which the total number of distinct real roots of T 
remains invariant. Now, since the path T has a finite cover of such neigh- 
borhoods N(r[), N(r' 2 ), ■ ■ ., N(r' k ) over each of which the total number 
of distinct real roots remain invariant, this number also remains invariant 
over the entire path T. Hence, as C is path connected, the lemma follows 
immediately. □ 

As an immediate corollary of the preceding lemmas we have the follow- 
ing: 

Corollary 8.6.5 Let T C . . x„] be a set of polynomials, deline- 

able on a connected set C C M K_i|z . 

1. The complex roots of T vary continuously over C. 

2. The real roots of T vary continuously over C, while maintaining their 
order; i.e., the j th smallest real root of T varies continuously over C. 
□ 

Using this corollary, we can describe how the real roots are structured 
above C. Consider the cylinder over C obtained by taking the direct prod- 
uct of C with the two-point compactification of the reals, lU {±00}. Note 
that the two-point compactification makes it possible to deal with vertical 
asymptotes of the real hypersurfaces defined by T . 

The cylinder C x (R U {±00}) can be partitioned as follows: 

Definition 8.6.5 (Sections and Sectors) Suppose T C . . ., x n ] 

is delineable on a connected set C C . Assume that T has finitely 

many distinct real roots over C, given by m continuous functions 



nip'), r 2 (p'), r m (p'), 
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Figure 8.2: Sectors and sections over C. 



where rj denotes the j th smallest root of T . (See Figure 8.2). Then we 
have the following: 

1. The j th T-section over C is 

{(p',x n ) : p' eC, x n = rj(p')Y 

2. The j th (0 < j < n) intermediate T-sector over C is 

{(p',x n ) : f/€C, rj(p') < x n < r j+1 (p')Y 
The lower semiinfinite T-sector over C is 

{(P ,x n ) : p' e C, x n < ri(p')}- 
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Figure 8.3: Sectors and sections for the parabola example. 



The upper semiinfinite T-sector over C is 

|V,a;„) : p' e C, x„ > r m (p')}- □ 

Example 8.6.6 Consider the real polynomial f(x, y) — y 2 — x e M.[r\, y] 
defining a parabola in the plane (see Figure 8.3). Note that 

f(x,y) = y 2 (Fj+y 1 (0) + y° (-x), 

and thus 

f 2 (x) = l, f\x)=0, and f(x) = -x. 
Since the subresultant chain of / = y 2 — x and f — D y (f) — 2y are given 

by 

SubRes 2 (/,/') = y 2 -x 

SubRes! (/,/') = 2y 

SubRcso (/,/') = -Ax, 

the principal subresultant coefficients are 

PSC 2 (a;) = 1, PSCi = 2, and PSC = -Ax. 

Thus 

$({/}) = {l,x}, 
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and the maximal connected /-delineable sets are A = [— oo, 0), B = [0,0] 
and C = (0, +oo]. There is only one infinite sector over A, as for every 
x G A, y 2 — x has no real zero. There are two semiinfinite sectors and 
one section over B, as y 2 — x has one zero (of multiplicity two) at y = 0. 
Finally, there are three sectors and two sections over C, as for every x G C, 
y 2 — x has two distinct real zeros (each of multiplicity one) . 

Note that as we traverse along the ai-axis from — oo to +oo, we see that 
y 2 — x has two distinct complex zeros for all x < 0, which coalesce into one 
real zero at x = and then split into two distinct real zeros for x > 0. □ 

Observe that the JF-sections and sectors arc uniquely defined by the 
distinct real root functions of T: 

ri(p'), r 2 (p'), r m (p'), 

where it is implicitly assumed that not all fi iP i = (ft G T). It is sometimes 
easier to use a single multivariate polynomial g = H{J r ){x\, . . ., x n ) with 

g'p(xn) = g(p',x n ) 

vanishing precisely at the distinct roots r\{p'), r 2 {p'), ■ . . , r m (p'). 
U(J r )(xi,...,x n )= J | fi(xi,...,x n ), 

fi€F,fi, p ,20 

where p' G C, a connected ^"-delineable set. By convention, we shall have 
n(^ r ) = 1, if all fi tP i = (fi G T). Also, when n(J r ) = constant, the 
cylinder over C will have exactly one infinite T-sector. Cx{lU {±oo}. 

8.6.3 Tarski-Seidenberg Theorem 

As an immediate consequence of the preceding discussions, we are now 
ready to show that semialgebraic sets are closed under projection. A more 
general result in this direction is the famous Tarski-Seidenberg theorem. 

Definition 8.6.7 (Semialgebraic Map) A map ip : S — > T, from a 
semialgebraic set S C M > to a semialgebraic set T C 1 K is said to be 
a semialgebraic map, if its graph 

{(s,V(s)) GM >+K :~G§} 

is a semialgebraic set in R >+K . □ 

Theorem 8.6.6 (Tarski-Seidenberg Theorem) Let S be a semialge- 
braic set in M > and 

ip : — > M K 
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be a semialgebraic map; then ip(S) is semialgebraic in M K . 
PROOF. 

The proof is by induction on m. We start by considering the base case 
m = 1. Then the graph of ip, say V, is a semialgebraic set in R ><+ * : and 
the image of 5*, tp(S), is a subset of M K . After suitable renaming of the 
coordinates, we can so arrange that V is defined by a set of polynomials 
T C R[^if , . . ., x„+i] and ?/>(S) = 7r(V) is the projection of V onto the 
first n coordinates: 

7T : M™ +1 -» R* 

: • • • ,€n,£n+l) !-» (6, • • • ,£n)- 

Corresponding to the set J 7 , we can define the set of polynomials as 
earlier. Now, note that if C is a cell of a sign-invariant cell decomposition, 
K, of K K , defined by then C is a maximal connected .F-delineable 

set. 

Next, we claim that for every C £ /C, 

Cn7r(y)^0 CCtt(1/). 

To see this, note that since C n 7r(V) 7^ 0, there is a point p e V, such 
that p' = n(p) e C. Thus p belongs to some ^"-section or sector defined by 
some real functions rj(j/) and rj+ifj/). Now consider an arbitrary point 
q' G C; since C is path connected, there is a path 7' : [0, 1] — > C such 
that 7'(0) = p' and 7 ; (1) = </■ This path can be lifted to a path in V, by 
defining 7 : [0, 1] — ► V as follows: 

(f] n +1 {j{t))[p - n{ P ')] - n{j{t))[p r l+1 {p>)] 
7U ~ r i+1 {p')-n{p') 

where t e [0, 1]. 

Clearly, the path 7([0, 1]) e V; 7r( 7 (t)) = j'(t), and g' e tt(V), as 
required. Hence ip(S) = ir(V) can be expressed as a union of finitely many 
semialgebraic cells of the decomposition JC, since 

tt(V) C (J \C : Cnn(V)^<b} C tt(V). 

Hence, V'(S') is semialgebraic in R K . 

For m > 1, the proof proceeds by induction, as any projection from 
n : M > x R K — > M K can be expressed as a composition of the following two 
projection maps: n' : R>~ F x R K+if -» M K+if and tt' : R K+F -> M K . □ 

Corollary 8.6.7 Let S be a semialgebraic set in R> and 

ip : R > — ► R K 
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be a polynomial map; then ip(S) is semialgebraic in M K . 

PROOF. 

Let tp be given by the following sequence of polynomials 

g k (x 1 , . . .,x rn ) e M[^ F , . . . ,^>], "I =JK, . .. , x. 
Then the graph of the map is denned by 

(5xI K )nT, 

where 

T = {<&,...,&,, Cu-..,(n) eR >+K : 

9k(€i, ■ --,6n) ~ Cfe = 0, for all k = 1, . . . ,n|. 

Thus V' is a semialgebraic map and the rest follows from the Tarski-Seidenberg 
theorem. D 

8.6.4 Representation and Decomposition of 
Semialgebraic Sets 

Using the ideas developed in this chapter (i.e., Sturm's theory and real al- 
gebraic numbers), we can already see how semialgebraic sets in RU {±00} 
can be represented and manipulated easily. In this one-dimensional case, 
the semialgebraic sets can be represented as a union of finitely many inter- 
vals whose endpoints are real algebraic numbers. For instance, given a set 
of univariate defining polynomials: 

^={/ y WeQH : n = JH,...,>, 3=JK,...,<a}, 

we may enumerate all the real roots of the /»,/s (i.e., the real roots of the 
single polynomial F = FJ i ■ fij) as 

-00 < & < 6 < ■■■ < 6-1 < 6 < 6+1 < ••• < 6 < +00, 

and consider the following finite set K of elementary intervals defined by 
these roots: 

[-00,6), [a, 6], (6,6), .... 

(6-1,6), [6,6], (6,6+1), [6,6], (6, +00]. 

Note that, these intervals are defined by real algebraic numbers with defin- 
ing polynomial 

n^= n 
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Now, any semialgebraic set in S C R U {±00} denned by T: 



m li 



5 =un{^ KLj { ±M } : s s n (^j(o) 



i=i j=i 



where Sjj £ {— 1, 0, +1}, can be seen to be the union of a subset of elemen- 
tary intervals in JC. Furthermore, this subset can be identified as follows: if, 
for every interval C £ fC, we have a sample point a>c € C, then C belongs 
to S if and only if 



For each interval C G fC, we can compute a sample point (an algebraic 
number) ap as follows: 



Note that the computation of the sample points in the intervals, their 
representations, and the evaluation of other polynomials at these points 
can all be performed by the Sturm theory developed earlier. 

A generalization of the above representation to higher dimensions can 
be provided by using the machinery developed for dclincability. In order 
to represent a semialgebraic set SCl K , we may assume recursively that 
we can represent its projection ir(S) C R K ~ F (also a semialgebraic set), 
and then represent S as a union of the sectors and sections in the cylinders 
above each cell of a semialgebraic decomposition of tt(S). This also leads 
to a semialgebraic decomposition of S. 

We can further assign an algebraic sample point in each cell of the 
decomposition of S recursively as follows: Assume that the (algebraic) 
sample points for each cell of n(S) have already been computed recursively. 
Note that a vertical line passing through a sample point of tt(S) intersects 
the sections above the corresponding cell at algebraic points. From these 
algebraic points, we can derive the algebraic sample points for the cells of 
S, in a manner similar to the one-dimensional case. 

If T is a defining set for S C R K , then for no additional cost, we 
may in fact compute a sign invariant semialgebraic decomposition of R* 
for all the sign classes of J 7 , using the procedure described above. Such 
a decomposition leads to a semialgebraic cell-complex, called cylindrical 
algebraic decomposition (CAD). This notion will be made more precise 
below. Note that since we have an algebraic sample point for each cell, 
we can compute the sign assignment with respect to T of each cell of 
the decomposition and hence determine exactly those cells whose union 
constitutes S. 




[6-1, 

6, 

& + 6+0/2, 
k 6 + 1, 



if c 
if c 
if c 
if c 
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8.6.5 Cylindrical Algebraic Decomposition 

Definition 8.6.8 [Cylindrical Algebraic Decomposition (CAD)] A 

cylindrical algebraic decomposition (CAD) of is denned inductively as 
follows: 

• Base Case: n = 1. 

A partition of into a finite set of algebraic numbers, and into the 
finite and infinite open intervals bounded by these numbers. 

• Inductive Case: n > 1. 

Assume inductively that we have a CAD JC' of M K ~ F . Define a CAD 
JC of R K via an auxiliary polynomial 

gc(x,x n ) = gc(xi, ■ ■ ■ ,x n _i,x n ) e Q[r\^, ■ ■ ■ 

one per each C G JC' . The cells of JC are of two kinds: 

1. For each C G JC', 

C x (lU{±oo}) = cylindrical over C. 

2. For each cell C G JC' , the polynomial gc(p',x n ) has m distinct 
real roots for each p' G C: 

n(p'),r 2 (p'), . . .,r m (p'), 

each n being a continuous function of p'. The following sectors 
and sections are cylindrical over C": 

Cq = {(p',x n ) : x n G [-oo,n(f/))}, 

Ci = { (*/,*„}: a„e [ri(i/),n (*/)]}, 

CT = {<?/,*„) : ar„e(n(p'),r2(p'))} ) 

C" 2 = {(p',x n ) : x n e[r 2 (p'),r 2 (p')}}, 

C m = {{p',x n ) : ie„ e [r TO (p'),r m (f/)]}, 

= |(p',a; n ) : i„ e (r m (p'),+oo]J. 

See Figure 8.4 for an example of a cylindrical algebraic decomposition of 
R*. □ 

Let 

F= {hj(xi,---,x n ) g Q[^,...,^ K ] : 3 = H Z ,...,>, 3=^,...,<n}- 

In order to compute a cylindrical algebraic decomposition of M K , which is 
^-sign-invariant, we follow the following three steps: 



350 Real Algebra Chapter 8 




a bed e 



Figure 8.4: Cylindrical algebraic decomposition. 

1. Project: Compute the in — l)-variate polynomials <&{T). Note that 
if \T\ is the number of polynomials in T and d is the maximum degree 
of any polynomial in T 1 then 

|*(.F)| = 0(d \F\ 2 ) and deg($(.F)) = 0(d 2 ). 

2. Recur: Apply the algorithm recursively to compute a CAD of R K ~ F 
which is $(J r )-sign- invariant. 

3. Lift: Lift the $(JT)-sign- invariant CAD of M K ~ F up to a J^-sign- 
invariant CAD of using the auxiliary polynomial II (.F) of degree 
no larger than d 

It is easy to see how to modify the above procedure in order that we 
also have a sample point (with algebraic number coordinates) for each cell 
of the final cell decomposition. The complete algorithm is as follows: 

CAD(JF) 

Input: f CQ[^,... 
Output: A ^-sign-invariant CAD of R K . 
if n = 1 then 

Decompose R U {±00} by the set of real roots of the polynomials 
of JF; Compute the sample points to be these real roots and their 
midpoints; 
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elsif n > 1 then 

Construct C Q[^>^, . . ., x n -i\, 

K! := CAD($(T)); 

Comment: K! is a $(:F)-sign- invariant CAD of R K ~*\ 

for each C' G K,' loop 

Construct II (JF), the product of those polynomials of T 
that do not vanish at some sample point a c > G C; De- 
compose RU {±00} by the roots of II(JF) into sections and 
sectors; 

Comment: The decomposition leads to a decomposition 
K, c > of C x (Ru{±oo}); 

The sample points above C'\ 



(a c >, ri(Q C /) - 1), 
(a c >, ri{a c >)}, 
(a c >, (n(a c -),r 2 (acr'))/2>, 
(a c >, r 2 (a c >)), 

{ac, r m (a C ')), 
(a c >, r m (a c ,) + 1), 



where n's are the real root functions for II (T); 

Each cell of the K-c' has a propositional defining sentence 
involving sign sequences for and JT; 

end{loop }; 



end{if }; 

return K,; 
end{CAD} □ 

Complexity 

If we assume that the dimension n is a fixed constant, then the algorithm 
CAD is polynomial in \T\ and deg(jF). However, the algorithm can be 
easily seen to be double exponential in n as the number of polynomials 
produced at the lowest dimension is 



each of degree no larger than d 2 " . Also, the number of cells produced 
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by the algorithm is given by the double exponential function 

(V| dcg(^)) 

while it is known that the total number of ^"-sign-invariant connected com- 
ponents are bounded by the following single- exponential function: 

deg^ " 

In summary, we have the following: 

Theorem 8.6.8 (Collin's Theorem) Given a finite set of multivariate 
polynomials 

T C Q[^f, • • • ,^k], 
we can effectively construct the followings: 

• An J 7 -sign-invariant cylindrical algebraic decomposition of JC of ]R K 
into semialgebraic connected cells. Each cell C € K, is homeomorphic 
to R s , for some < 5 < n. 

• A sample algebraic point pc in each cell C € K. and defining polyno- 
mials for each sample point pc ■ 

• Quantifier-free defining sentences for each cell C € K. D 

Furthermore, the cylindrical algebraic decomposition produced by the 
CAD algorithm is a cell complex, if the set of defining polynomials 

T C Q[^ F , . . . ,^><], 

is well-based in R K in the sense that the following nondegeneracy conditions 
hold: 

1. For all p' e M K - F , 

(v/ief) [fiip',x n )^o . 

2. <$>(F) is well-based in R"~ r . That is, For all p" el K ^, 

(v 5j -e$(.F)) [fc-(p",a; n _i)^0], 

and so on. 

The resulting CAD is said to be well-based. Also note that, given an 
T, there is always a linear change of coordinates that results in a well- 
based system of polynomials. As a matter of fact, any random change of 
coordinates will result in a well-based system almost surely. 



Section 8.6 



Real Geometry 



353 



Theorem 8.6.9 If the cellular decomposition JC produced by the CAD al- 
gorithm is well-based, then JC is a semialgebraic cell complex. 
PROOF. 

As a result of Collin's theorem, we only need to show that 

Closure of each cell Ci <G JC, Ci is a union of some cells Cj 's: 

d = \jc j . 

3 

The proof proceeds by induction on the dimension, n. When n = 1, it 
is easy to see that the decomposition is a cell complex as the decomposition 
consists of zero-dimensional closed cells (points) or one-dimensional open 
cells (open intervals) whose limit points (endpoints of the interval) are 
included in the decomposition. 

Let Ci G JC be a cell in R K , which is cylindrical over some cell C' k £ JC' , 
a CAD of 1R K_F . By the inductive hypothesis, we may assume that JC' is 
a cell complex and 

~C7 k = C' k u^u^u...uC[„ 

where C^.'s arc in JC' . 
We show that 

1. If Ci is a section, then d consists of 

(a) d itself. 

(b) Limit points of Cj. These are comprised of sections cylindrical 
over cells in dC' k . 

2. If d is a sector, then d consists of: 

(a) Ci itself. 

(b) Limit points of Ci. These are comprised of upper and lower 
bounding sections for d, cylindrical over C' Kl and sectors and 
sections cylindrical over cells in dC' k . 

The key idea is to show that, since sections are given by some continuous 
real root function rj(p'), the closure of a particular section d is simply the 
image of a real root function over some cell C' k C dC' k which extends 
r.i(p'). 

The proof is by contradiction: consider a sequence of points p' 1} p' 2 , p' 3 , 
. . ., in C' k , which converges to some point p'* G C' k C dC' k , say. This 
sequence of points can be lifted to a sequence of points in the section Ci 
by the real root function ry. 

Pi = (Pi,rj(p[)), p 2 = (P2,rj(jJ 2 )), Ps = <A,rj(j/ 3 )), ■■■■ 
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For every neighborhood N containing p'* , consider its image under the 
map rj. The intersection of all such neighborhoods must be a connected 
interval of J C p'* x R. Also, all the defining polynomials T must vanish 
over J. But as a direct consequence of the well-basedness assumption, we 
find that J must be a point contained in the image of a real root function 
over C' k C dC' k . The rest follows from a direct examination of the geometry 
of a cylindrical algebraic decomposition. D 

The resulting cell complex is usually represented by a labeled directed 
graph G = (V, E, 5, a), where 

V = vertices representing the cells 

E = edges representing the incidence relation among the cells 
uEv ^ C u C (X 

S : V — > N = dimension of the cells 

a : V — > { — 1,0, +1} = sign assignment to the cells 

Such a graph allows one to study the connectivity structures of the 
cylindrical decomposition, and has important applications to robotics path 
planning. G is said to be a connectivity graph of a cell complex. 

8.6.6 Tarski Geometry 

Tarski sentences are semantic clauses in a first-order language (defined 
by Tarski in 1930) of equalities, inequalities, and inequations of algebraic 
functions over the real. Such sentences may be constructed by introducing 
the following quantifiers, "V" (universal quantifier) and "3" (existential 
quantifier), to the propositional algebraic sentences. The quantifiers are 
assumed to range over the real numbers. 

Let Q stand for a quantifier (either universal V or existential 3). If 
<f>(yi, ■ ■ •, y r ) is a propositional algebraic sentence, then it is also a first- 
order algebraic sentence. All The variables y's are free in <j>. Let 
. . ., y r ) and ^(z\, . . ., z s ) be two first-order algebraic sentences (with free 
variables y's and z's, respectively); then a sentence combining $ and ^ by 
a Boolean connective is a first-order algebraic sentence with free variables 
{yi} U {zi}. Lastly, let $(j/i, . . ., y r , x) be a first-order algebraic sentence 
(with free variables x and y), then 




is a first-order algebraic sentence with only y's as the free variables. The 
variable x is bound in (Q x)[$\. 



Section 8.6 



Real Geometry 



355 



A Tarski sentence 3>(yi, . . ., y r ) with free variable y's is said to be true, 
if for all (Ci, . .., Cr) € M x 

$(&,•••.&) =True. 
Example 8.6.9 1. Let f(x) e Z[^] have the following real roots: 

oei < ■ ■ ■ < atj-i < atj < ••• . 
Then the algebraic number ctj can be expressed as 

f(y) = A (bxi,...,^] 

(aj 2 - x 1 > 0) A • • • A (arj_i - x,-_ 2 > 0) 
A(/(x 1 ) = 0)A---A(/(^_ 1 ) = 0) 
A(Vz) [(/(*) = Ay- *>0) 

((2 - a;i = 0) V • • • V (z - Xj-\ = 0))] 

If (l,r) is an isolating interval for ctj, we could also express the real 
root a.j by the following Tarski sentence: 

(f(y) = 0) A (y - I > 0) A (r - y > 0) 
A (Va:) ((a;-y^0)A(a;-Z>0)A(r-a;>0)) => f(x)^0 . 

2. Consider the following Tarski sentence: 

(3 x) (V y) [(y 2 - x > 0) . 

The sentence can be seen to be true, since if we choose a strictly 
negative number as a value for x, then for all y, the difference of y 2 
and x is always strictly positive. 

Next, consider the following Tarski sentence: 
(3 x) (V y) [(y 2 - x < 0) . 

The sentence can be seen to be false. 

3. Let S C R K be a semialgebraic set; then its closure S can be defined 
by the following Tarski sentence: 

= (V e) [(e > 0) (3 y) [$ s (y) A ||i - y|| 2 < e] 

where $5 is a defining formula for 5. S — {x : ^(x)}. 
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4. Let C\ and C2 be two cells of a cylindrical algebraic decomposition 
of R K , with defining formulas $Ci an d &c 2 > respectively. C\ and C2 
are adjacent if and only if 

Cin^^0 or cTnc 2 ^0. 

The following Tarski sentence characterizes the adjacency relation: 

(((e>0)A$ Cl (5)) (*o 2 (»)A(||i-y|| 2 <e))) 
V(((e>0)A<I>c 2 (5)) (*c 2 (y)A(||i-y|| 2 <e))) 

Since we shall produce an effective decision procedure for Tarski sen- 
tences, we see that one can construct the connectivity graph of a cell 
complex effectively, provided that we have the defining formulas for 
the cells of the cell complex. D 

A Tarski sentence is said to be prenex if it has the form 

(Qxi) (Qx 2 ^ ■ • ■ (q in) (j)(y 1 ,y 2 ,...,y r ,xi,...,x n ) , 

where <fi is quantifier-free. The string of quantifiers (Q x\) (Q x%) ■ • • (Q x n ) 
is called the prefix and cf) is called the matrix. Given a Tarski sentence \£, 
a prenex Tarski sentence logically equivalent to ^ is called its prenex form. 

The following procedure shows that for every Tarski sentence, one can 
find its prenex form: 

1. Step 1: Eliminate redundant quantifiers. Replace a subformula 
(Q x){&] by if x does not occur in $. 

2. Step 2: Rename variables such that the same variable does not occur 
as free and bound. If there are two subformulas *&(x) and (Q x)[<j>(x)] 
at the same level, replace the latter by (Q x new )[$(x new )], where x new 
is a new variable not occurring before. 

3. Step 3: Move negations (-1) inward. 

n(Vi)[$(i)] -> (3 x)[-^<f>{x)] 

n(3l)[$(l)] -» (Vs)h$(l)l 

-i($V*) -> (-i$A-i*) 
^($A*) -> (-.$V-.*) 
-1-1$ — > $ 
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4. Step 4: Push quantifiers to the left. 



(2i)[J(i)]A« 
(fii)[$(i)]V$ 
*A(Qi)[$(i)] 
*V(Qi)[$(i)] 



(Qx)[$(x)Afj 
(Qi)[*(s)V*j 
(fii)[*A$(i)] 
(Qi)[$V§(i)] 



Example 8.6.10 Consider the following Tarski sentence: 

V x) [((V y) [f(x) = 0] V (V z) [g(z, y) > 0]) => -(V y ) y) < 0] 
After eliminating redundant quantifiers and renaming variables, 

(V x) [([/(x) = 0] V (V z) [ 5 (z,y) > 0]) => -(V «;) [/i(x,«/) < 0]" 
After simplification, we have 

V x) [([/(x) ± 0] A -(V z) [g{z, y) > 0]) V -(V tu) «;) < 0] 
After moving negations inward, 

(Vx) [([/(x) ^ 0] A (3 z) [<,(*, y) < 0]) V (3 w) [h(x,w) > 0} . 
After pushing the quantifiers outward, 
(V x) (3 z) (3 w) [((/(x) £ 0) A (. 9 (z, y) < 0)) V (/i(x, w) > 0)] . □ 

Finally, we are ready to consider an effective procedure to decide whether 
a given Tarski sentence \I/(xi, . . ., x r ) is true. Here assumed 
to be its free variables, and the polynomials occurring in are assumed to 
have rational coefficients. 

As a result of our earlier discussion, we may assume that our Tarski 
sentence is presented in its prenex form and that it is universally closed 
with respect to its free variables. 



(Vxi) •••(Vx r ) f(xi,...,x r ) 
= (Vxi) ••• (Vx r ) (Qa; r +i) 



(Qx n ) [lp(xi, ■ ■ . ,X r ,X r+ i, . . . ,X„)], 



ip is a quantifier-free matrix. Thus from now on we deal only with prenex 
Tarski sentences with no free variable and where the variables are so ordered 
that in the prefix the n variables appear in the order 

*£l j %2 ; • • • j <En • 

One can describe the decision procedure for such a Tarski sentence in 
terms of a Player-Adversary game. We start with the following illustra- 
tions: 
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Example 8.6.11 1. The game for the following Tarski sentence pro- 
ceeds as shown below: 



(3 x) (V y) [{y 2 -x>0) 



The first quantifier is 3 and the first move is Player's. Player chooses 
a strictly negative number for x, say — 1. The second quantifier is 
a V and the next move is Adversary's. Now independent of what 
Adversary chooses for y, we see that 

y 2 - (-1) = y 2 + l > 0. 

The matrix is true, and hence Player wins and the Tarski sentence is 
true. 

2. Next consider the game for the following Tarski sentence: 



3 x ) ( V y 



(y 2 -x<0) 

Again the first quantifier is 3 and the first move is Player's. Let 
Player choose x = a. The second quantifier is a V and the next move 
is Adversary's. Adversary chooses y = a + (1/2), and we have 

y 2 -x = (a+l/2) 2 -a = a 2 + 1/4 > 0. 

The matrix is false, and hence Adversary wins and the Tarski sentence 
is false. 

Thus given a Tarski sentence 

(Qixi) ••• (Q n x n ) [V>(zi,...,a;„)], 

the i th (i = 1, . . ., n) move is as follows: Assume the values selected up to 
this point are 

Ci) C2, • ■ • , C*-i 

• If the i th quantifier Qj is 3, then it is Player's move; otherwise, if Qi 
is V, then it is Adversary's move. 

• If it is Player's move, he selects Xi — d in order to force a win for 
himself, i.e., he tries to make the following hold: 

(Qi+iXi+i) ■■■ (Q n x n ) [ip(Ci,...,Ci,x i+ i,...,x n )] =True. 

If it is Adversary's move, he selects x% — d in order to force a win 
for himself, i.e., he tries to make the following hold: 



(Q i+1 x t+1 ) ■■■ (Q n x n ) [V>(Ci> ■■■ Xi, x i+1, ■■■ .%)] = False - 
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After all Q's have been chosen, we evaluate 

V'(Ci.---.Cn); 

If it is true, then Player wins (i.e., the Tarski sentence is true); otherwise, 
Adversary wins (i.e., the Tarski sentence is false). Thus, the sequence of 
choices in this game results in a point 

p = (Ci,...,Cn> eR K , 

and the final outcome depends on this point p. 

Let T C Q[^if , . . ., x n ] be the set of polynomials appearing in the 
matrix ip. Now consider a cylindrical algebraic decomposition K. of for 
T . Let C p G /C be the cell containing p. If g e C p is a sample point in the 
cell C p 

9 = (ai,..., a„) G M K , 

then the aj's constitute a winning strategy for Player (respectively, Ad- 
versary) if and only if Q's also constitute a winning strategy for Player 
(respectively, Adversary). Thus, the search could have been conducted only 
over the coordinates of the sample points in the cylindrical algebraic decom- 
position. This leads to an effective procedure, once a cylindrical algebraic 
decomposition for J 7 , endowed with the sample points, have been computed. 

Since the cylindrical algebraic decomposition produces a sequence of 
decompositions: 

ACiofR^, AC^ofR*, /C K ofE K , 

such that the each cell Cj-ij of JCi is cylindrical over some cell Cj_i of 
/Cj_i, the search progresses by first finding cells C\ of JC\ such that 

(Q2X2) ■ ■ ■ (QnX n ) [ip(aci ,X2,---, x„)\ = True. 

For each C\ , the search continues over cells C12 of K.2 cylindrical over C\ 
such that 

(232:3) ••• (Q n x n ) [ip(a Cl ,ac 12 ,X3,...,x n )]=Twe, 

etc. Finally, at the bottom level the truth properties of the matrix ip are 
evaluated at all the sample points. 

This produces a tree structure, where each node at the (i — l) th level 
corresponds to a cell Cj_i G /Cj_i and its children correspond to the cells 
Ci-ij € JCi that are cylindrical over Cj_i. The leaves of the tree correspond 
to the cells of the final decomposition K, = K, n . Using the game-theoretic 
nature of the problem discussed earlier we can further label every node at 
the (i-l) th level "AND" (respectively, "OR") if Qi is a universal quantifier 
V (respectively, 3). Such a tree is a so-called AND-OR tree. 
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The tree may be evaluated as follows: First label the leaves true or 
false, depending on whether the matrix ip evaluates to true or false in 
the corresponding cell. Note that the truth value of ip depends only on 
the sign assignment of the cell. Inductively, assuming all the nodes up to 
level (i — 1) have been labeled, an i th level node is labeled true if it is an 
AND (respectively, OR) node and all (respectively, some) of its children 
arc labeled true. Finally, the Tarski sentence is true if and only if the root 
of the tree is labeled true. This constitutes a decision procedure for the 
Tarski sentences. 

Consider a sample point 



in some cell C £ IC. Assume that the algebraic number oti has an interval 
representation (fi,h,ri). The truth value at a leaf corresponding to C, 
(i.e., ip{ati, . . ., a n )) can be expressed by the following logically equivalent 
quantifier-free sentences involving only polynomials with rational coeffi- 
cients: 



Thus each leaf of the AND-OR tree can be expressed as a quantifier-free 
sentence as above. Now, the tree itself can be expressed as a quantifier- 
free sentence involving conjunctions and disjunctions for the AND and OR 
nodes, respectively. Clearly, all the polynomials involved are over Q. 
For examples, consider the sentences 



p = (ai,a 2 , . . . , a n ), 



(fl(zi) = f 2 (z 2 ) = --- = f n (z n ) = 0) 

A (h < Zi < n) A ■ ■ ■ A (l n < z n < r n ) 

A 1p(zi, Z 2 , ■ ■ ■ , Zn). 



(3 x) (V y) [(y 2 



x>0) 



and 




The sample points for a CAD of y 2 — 



x are as follows: 



(1,2) 
(1,1) 
(1,1/2) 

(1,0) 
(1,-1/2) 
(1,-1) 
(1,-2) 




(0,1) 
(0,0) 

(0,-1) 



> 



The equivalent quantifier-free sentences are 



(0>-l) 

V (1 > 0) A (0 > 0) A (1 > 0) 

V (4 > 1) A (1 > 1) A (1/4 > 1) A (0 > 1) A (1/4 > 1) A (1 > 1) A (4 > 1), 
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and 



(0<-l) 

V (1< 0) A (0 < 0) A (1 < 0) 

V (4 < 1) A (1< 1) A (1/4 < 1) A (0 < 1) A (1/4 < 1) A (1< 1) A (4 < 1). 



By a simple examination we see that the first sentence is true, while the 
second is false. 

In summary, we have the following: 

Theorem 8.6.10 Let $ be a Tarski sentence involving polynomials with 
rational coefficients. Then we have the following: 

• There is an effective decision procedure for 

• There is a quantifier-free propositional sentence <j> logically equiva- 
lent to <f>. The sentence (j) involves only polynomials with rational 
coefficients. □ 

Corollary 8.6.11 Tarski sets (subsets ofM. K defined by a Tarski sentence) 
are exactly the semialgebraic sets. □ 



Problems 

Problem 8.1 

In an ordered field K, we define \x\ for all x € K as follows: 



Prove that for all x, y € K, 

(i) \x + y\ < \x\ + \y\. 

(ii) \xy\ = \x\ \y\. 



Problem 8.2 

Give a proof for the following: 

Let K be a real closed field and f{x) e K[x\. If a, b £ K, a < b, then 




if x > 0; 
if x < 0. 



(3 c € (a, 6)) [/(&) - f(a) = (b- a)D(f )(c) . 



This is the so-called intermediate value theorem. 
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Problem 8.3 

Is it true that every integral polynomial f(x) £ Z[^] has all its real 
roots in the closed interval [— ||/|joo, ||/||oo]? 
Hint: Consider the polynomial x 2 — x — 1. 



Problem 8.4 

Show that an algebraic number has a unique minimal polynomial up to 
associativity. 



Problem 8.5 

Consider a real univariate polynomial 

f(x) = x n + a n —ix n l H h a m x m + a x x + a Q E 

and u a real root of /. 

(i) Lagrange- Maclaurin's Inequality. Let 

a n -i > 0, . . . , a m -i > 0, and a m < 0. 

Prove that 

u < 1 + I min(ai) I 
Hint: Assume that u > 1 and 

= /(w) = m™ + a„_iw™~ 1 H ha m u m + aiw, + a 

> u n -mm(a l )(u m + u m - 1 + ---+u + l) 

' u m+l _ ^ 



u n — min(ai) 



u- 1 



(ii) Cauchy's Inequality. Let 

a mi < 0, a m2 < 0, . . . , a mfe < 0, (mi > m 2 > • • • > m fe ), 
be the only negative coefficients of /. Prove that 

U <max((fc |a mj |) 1/( "- m ^). 
Hint: Assume that u > 1 and 



= /(«) = w" + a„_iw™ 1 H h a m u m + am + a Q 

> u n -a mi u mi a mk u mk 

> u n - kmax(\a mj \u mj ). 

3 
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Problem 8.6 

Let f(x) G be a univariate real polynomial of positive degree 

n > 0. 

(i) Consider /'s Fourier sequence: 

FOURIER(/) =( f<-°\x) = f(x), 

fW(x) = f'(x), 
f {2 \x), 

f (n) (x) ), 

where /W denotes the i th derivative of / with respect to x. Prove the 
following: 

Budan-Fourier Theorem. 

Let f(x) e R[r^] and a and b e M be two real numbers with 
a < b. Then 

# real roots of / (counted with multiplicity) in (a, b) 

!< Var a (FOUMER(/)) - Var b (FOURiER(/)), and 
= Var a (FOURlER(/)) - Var 6 (FOURlER(/)) (mod2). 

(ii) Using the Budan-Fourier theorem, present a proof for the following: 

Descartes' Theorem. 

Let f(x) e R[r\] be as follows: 

f{x) = a n x" + an-ix 11 ^ 1 + h ao, 

and 

V{f) = Var((o n , a n _i, . . . , a )). 

Then the number of strictly positive real roots of / (counted 
with multiplicity) does not exceed V(f) and is congruent to 
V(f) (mod2). ' 

Problem 8.7 

Let (ho, hi, . . ., h s ) be a Sturm sequence of / and /'. 
Let V be the number of sign variations in the sequence 

v = (Hcoef(/io), Hcoef(Zii), Hcocf(/i s )), 
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and W be the number of sign variations in the sequence 

w = (SHcoef(/io), SHcoef(/n), SHcoef(/i s )), 
where Hcoef(/i) is the leading coefficient of h and 

SHcoef(h) = (-l) des ^Hcoei(h) 

is the "sign- adjusted" leading coefficient of h. 

Show that the number of distinct real roots of / is W — V. 

Compute v and w for a quadratic polynomial x 2 + bx + c. What is 

— V as a function of b and c? Can you derive an algebraic criterion for 

the number of real zeros of x 2 + bx + c? 



Problem 8.8 

Let /, gi and g^ be a simple set of polynomials in K[x] (K = a real 
closed field), in the sense that all their roots are distinct, i.e., they are all 
square-free and pairwise relatively prime. 

(i) For any interval [a, i]CJ( (a < 6), show that 



1111 

I- 11-1 

II- 1-1 
1-1-1 1 



c f 



<?i > 0, 5 2 > 



3i > 0, 32 < 



3i < 0, 32 > 



3i < 0, 32 < 



Var 
Var 
Var 



STURM(/, f ) 



STURM (/, f gi ) 



sturm(/, f'g 2 ) 



Var 



sturm(/, fgxgi) 



(ii) Show how the preceding formulation can be generalized to the case 
when /, 3i and 32 are not simple. 
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(iii) Show how to obtain a further generalization of this formulation 
when there are more than two g's: /, gi, 52, • • ■, g n - 

(iv) Suppose you are given a system of polynomial inequalities, involv- 
ing polynomials gi, g2, ■ ■ ., g n in R[W], as follows: 

9i(x) § 0, 
92(x) § 0, 



9n{x) § 0, 

where the notation u gi(x) 0" represents one of the following three re- 
lations: gi(x) < 0, gi(x) = or gi(x) > 0. Using the formulation (iii) 
and linear algebra, devise a process to determine if there is a solution x at 
which all the inequalities are satisfied. 

Hint: Construct a sequence /, gi, g 2l ■ ■ ., g n as in (iii), where / is 
such that, for any sign assignment to gi's, the interval corresponding to 
this assignment, / has a zero: 

G{x) = gi{x) g 2 {x) ■ ■ ■ g n {x), 
G{x) 



Let 
where 



G(x) GCD{G{x),G'(x))' 
f{x) = G{x) G\x){x - N){x + N), 



N = 1 + max - 
a 

the diS are coefficients of G(x), and a,k = Hcoef(G(a:)). 



Problem 8.9 

Let f(x) and g(x) e Z[^] be two arbitrary integral polynomials of 
positive degrees m = deg(/) and n — deg(g), respectively. Show that 



/(a) = ° r I/H> TTWTW 



(V a, s.t. sr(a) = 0) 
Hint: Consider the zeros of the resultant, Resultant x (<7(x), f(x) — y) 



Problem 8.10 

Consider the following monic irreducible integral polynomial f(x), 
f (x) = x n - 2(ax - l) 2 , n>3,a>3,aeZ. 
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Show that 



Separation(/) < 2- 



(||/|| 1 /4)«+V4- 

Hint: Show that /(1/a) > and f(l/a±h) < 0, for h = a- {n+2 ^ 2 . 



(8.1) 
(8.2) 
(8.3) 



Problem 8.11 (Hadamard- Collins-Horowitz Inequality.) 
If S is a commutative ring, a seminoma for S is a function 

i/:S^{r>0:r£R} 

satisfying the following three conditions: For all a, b £ S, 

v(a) = <=> a = 0, 
v{a + b) < u(a) + v(b), 
v{a b) < v{a) v{b). 

(i) Show that ||a||i = \a\ is a scminorm for the integers Z. 

(ii) Show the following: 

1. If v is a seminorm over S, then its extension to S[x] defined below 

v{a n x n + a„_i.T™ _1 H ha ) = "K) + f(«n-i) H h v(a ) 

is also a seminorm over 5 [a;]. 

2. If is a seminorm over 5*, then its extension to an arbitrary matrix 

M e S' mx ™ 

i=l j=l 

satisfies the conditions (8.1), (8.2) and (8.3), whenever the operations 
arc defined. 

(iii) If M is a square matrix over a commutative ring S with a scminorm 
v, then show that the following generalization of Hadamard's inequality 
holds: 

i/(detM) < ]J^( M i), 

i 

where Ma is the i th row of M and det M is the determinant of M. 



Problem 8.12 

Consider an n x n polynomial matrix with integral polynomial entries 



M(x) 



Ai,i(x) A lj2 {x) ■■■ A hn (x) 
MA X ) MA*) ■■■ M,n{x) 

A nA (x) A ni2 {x) ■■■ A n . n (x) 
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Prove that 



where 



<lot(,U(,r))^() « dot .(jl/(-JL- ))=(). 



a = max HAij Iloo and = maxdeg(Ajj). 



Problem 8.13 

Consider two real algebraic numbers a and (3 defined by two polynomials 
f(x) and g(x) of respective positive degrees m and n. Prove that if a ^ (3 
then 

A = |a — /3| > 2(n+1)(m+1) n/iinyiy 



Problem 8.14 

Let /(x) G C[^] be an arbitrary polynomial: 

/(x) = c„x™ + c n _ix™ _1 H h Co. 

Consider the usual norm for complex numbers: 

\a + ib\ = {a 2 +b 2 ) l l 2 . 
The 1-norm of / can now defined to be 

ll/lli = l c «l + l c «-il H 1" l c o|- 

Let the zeros of / be enumerateed as 

161 < 161 < •••16,1 < 1 < IWil < ■■■ < 161, 

counting each zero as many times as its multiplicity. 
Let M{f) be defined as 



M(f) 



n JJmax(l,&) 



c. 

i=l 



|c„| J|max(l, |&|). 



First we want to derive the following relations: 

M(f)<\\f\\ 1 <2 n M(f). 

(i) Prove that 

n/iii = Ei c «^^---^i^ 2 "^(/)- 
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(ii) Next show that 

IogM(/) = -!- \og\f{e a )\dt< max log |/(e lt )| < log ||/ \\ u 

ZlT Jq l<t<27T 

Thus, concluding that 

M{f)<\\fh. 
Hint: Use Jensen's integral formula: 

— ^ Iog|F(pe <t )|dt = log|F(0)|+2log|||, F(0)?0, 

where F(x) is a function of the complex variable, regular on the circle of 
radius p. The zeros of F in |a:| < p are given as £i, . . ., £ m . 

(iii) Using these inequalities show that, if / is factorized as follows 

f(x) = f 1 (x)f 2 (x)---f s (x), 

then 

||/i||i||/ 2 ||i---||/ s ||i<2 de ^) Wfh. 



Problem 8.15 

Let f(x) be a polynomial with zeros in C. Then we denote its complex 
root separation by A(/): 

A(/) = min{|a - 0\ : a ^ e C A 0(a) = U(J3) = V}. 

Let f(x) be an integral polynomial: 

f(x) = a n x n + a„_ix rl_1 H h a . 

Prove the following: 

(i) If f(x) is square-free, then 

A(/)>V3n-("+ 2 >/ 2 \\f\\-^\ 

(ii) In general, 

A(/) > 2-"("- 1 ) n-("+ 2 )/ 2 ||/||r (n_1) . 



Problem 8.16 

(i) Devise a simple and efficient (0(n 3 lgn)-time)algorithm to convert 
a real algebraic number from its order representation to its interval repre- 
sentation. 

(ii) Prove the following corollary of Thorn's lemma: 
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Consider two real roots £ and £ of a real univariate polynomial 
f(x) e of positive degree n > 0. Then 

if and only if, for some < m < n, the following conditions 
hold: 

sgn(/ (m) (0) ± sgn(/ (m) (C)) 

sgn(/< m+1 >(£)) = sgn(/( m+1 )(C)), 



sgn(/ (n) (0) = sgn(/(")(C)). 

and 

1. either sgn(.f < m+1 )) = +1 and / (m) (£) > / M (C), 

2. or sgn(/(" l+1 )) = -1 and /( m >(£) < / (m) (0- 

(iii) Using the corollary above, devise an efficient (0(n 3 lg 2 n)-time) 
algorithm to convert a real algebraic number from its sign representation 
to its interval representation. 



Problem 8.17 

Prove that if S is a semialgebraic set then its interior, int(S*), closure 
S, and boundary d(S) = S \ int(S') are all semialgebraic. 



Problem 8.18 

Show that every semialgebraic set is locally connected. 



Problem 8.19 

Let SCM K be a semialgebraic set. Prove that for some m e N, there 
is a real algebraic set T C M K+> such that 



tt(T) = S, 



where 



tm+m 



is a natural projection map. 

Thus, show that semialgebraic sets constitute the smallest class of sub- 
sets of K K closed under projection and containing real algebraic sets. 
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Hint: Let S C R K be defined as follows: 

S={(J!i ^€« K : s g n(U a (^ J K ) ...,^ K )) = ~ 3 }, 

where i = 1, . . ., m. Let us now define T as follows: 
X 1 ^ (xi , . . . , x n , x n j r \ , . . . , x n + m ) £ IR 

E (^/i + i) a + E /'+ E (4 + ,,/,-i) 2 = o}. 

j';sj<0 j;sj=0 j;sj>0 

Verify that S = tt(T). 

Problem 8.20 

A robotic system 1Z is defined to be a finite collection of rigid compact 
subparts 

j-Bi, -B 2 , . . . , £? m j, 

where each subpart is assumed to be defined by a piecewise algebraic sur- 
face. 

A configuration is an n-tuple of parameters that describes the positions 
and the orientations of the subparts uniquely; the corresponding space of 
parameters 1 K is called a configuration space. 

Additionally we may assume that between every pair of subparts Bi 
and Bj at most one of the following holonomic kinematic constraints may 
exist: 

• Revolute Joint: There is a fixed axis L through a pair of points 
Pi G Bi and pj € Bj such that Bi and Bj are only allowed to rotate 
about L. 

• Prismatic Joint: There is a fixed axis L through a pair of points 
Pi G Bi and pj € Bj such that Bi and Bj are only allowed to translate 
about L. 

(i) Bi's are assumed to be able to take any configuration subject to the 
kinematic constraints such that no two subpart occupies the same space. 

A point of the configuration space corresponding to such a configuration 
is said to be free. Otherwise, it is called forbidden. The collection of points 
of configuration space that are free are said to constitute the free space, 
and its complement forbidden space. 

Show that the free and forbidden spaces are semialgebraic subsets of 
R K . 

(ii) Given an initial and a desired final configurations of the robotic 
system, 1Z, the motion planning problem is to decide whether there is a 
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continuous motion of the subparts from the initial to the final configuration 
that avoids collision and respects the kinematic constraints. 

Devise an algorithm to solve the motion planning problem. 

Hint: First compute the connectivity graph for the polynomials defin- 
ing the free space, and show that there is a continuous motion from one 
configuration in a free cell to another configuration in a free cell, if there is 
a path between the vertices corresponding to the respective free cells. 

Problem 8.21 

Consider the following set of TV — 2 2 complex numbers corresponding 
to the N th roots of unity: 

{tti+ift : i= 1,...,JV} C C. 

Define 

S n = {(aupi) : » = 1,...,JV} C K*. 

Show that there is a Tarski sentence ^f n (x,y) with two free variables (x 
and y) and 0{n) quantifiers, 0(n) variables, 0(n) real linear polynomials, 
and O(l) real quadratic polynomial such that 

{(a,0) : *„(a,/3) = TRU E } = S n . 

Problem 8.22 

(i.a) Let gi(x,y) and gj(x,y) £ M.[r\,rx]. Define 

D{gi) = isolated points of gt = 0; 
D(gi, gj) = isolated points of gt — and gj = 
\(D( 9i )UD( 9j )). 

If gi(x, y) and gj(x, y) are irreducible polynomials, show that 

\D( 9l )\ < (dcg(. 9j )) 2 and \D( 9i , gj )\ < 2 deg( 9l ) deg( 9j ). 

(i.b) Given a quantifier-free sentence in two variables x and y, involving 
polynomials 

F = {/i( 

x, y), f2(x, y), . . . , f m (x, 

y)} c 

show that every isolated point is either in some D{gi) or some D{gi,gj), 
where gt's and gj's are irreducible factors of /j's. 

(i.c) Show that the total number of isolated points of a quantifier-free 
sentence defined by T is bounded from above by 

(m 
»=i 
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Problem 8.23 

As direct consequences of the preceding two problems, show the follow- 
ing: 

(i) For every n e N, there exists a quantified Tarski sentence ^ n with 
n quantifiers, of length 0(n), and degree 0(1) such that any quantifier-free 
sentence ij) n logically equivalent to \I/„ must involve polynomials of 

degree = 2" and length = 2 

(ii) For every n E N, there exists a quantified Tarski sentence $f n with 
n quantifiers, of length 0(n), and degree 0(1) such that \I>„ induces a 
cylindrical decomposition, JC n , of R > (m = 0(n)) with 

|/C n | = 2 2S!< "' cells. 

(iii) Thus, argue that both quantifier elimination problem and cylindri- 
cal algebraic decomposition problem have double exponential lower bounds 
for their time complexity as a function of the input size. 
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Problem 8.2 

First, we choose a linear function f{x) such that 

f(x)=f(x)-f(x), 
vanishes at the points a and b. Hence, 

f(a) = f(a) and /(&) = /(&). 

Thus, 



D(f) 



f(b)-f(a) _ 

6 — a 



and 



b — a b — a b — a 

Now we can apply Rolle's theorem to f(x) as f(a) = f(b) = 0: 

(3 c e (a, 6)) [l>(/)(c) = D(/)(c) - D(f){c) = . 

But since, D(f) = (f(b) — f(a))/(b — a), we have the necessary conclusion. 
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We may rewrite the formula above in the following form: 
f(b) = f(a) + /'(c) (6 - a), for some c e (a, b). 



Problem 8.4 

Consider an algebraic number a. Let f(x) and g(x) G Z[^] be two 
nonzero minimal polynomials of a. Clearly f(x) and g(x) have the same 
degree. Thus, if we consider the following polynomial 

r(x) = Hcoe%) f(x) - Hcoef(/) e Z^], 

thendeg(r) < deg(/) = dcg(g) and r(a) = Hcoef(<?) f{a) — Hcoef(/) = 
0. Thus r(x) must be identically zero as, otherwise, it would contradict our 
assumption that f(x) and g(x) were two minimal polynomials of a. Thus 

Hcoe%) f(x) = Hcoef(/) g(x) and f(x) w 5(2;). 



Problem 8.6 

(i) Using Taylor's expansion theorem, we may express the values of the 
Fourier sequence in a neighborhood of a real number c as follows: 

2 m 
f(c + e) = / ( c ) + e /(i)( c ) + - / (2)( c) + ... + _ / W( c ) 

n 

2 TO-1 

+ = / (1) (c) + e/( 2 )(c) + -/( 3 )(c) + ... + ^ TT ^/M(c) 

_n— 1 

/(™)( C + e) - /(H (c) + e/ (-+i) (c ) + ... + ___/(") (c) , 

/<">(c + e) = /W(c). 

Let us now consider an interval [a, 6] containing exactly one real root c 
of / with multiplicity m. Note that it suffices to consider the sign variations 
in some interval (c — e, c + e) C [a, 6]. Clearly: 



/(°)( c ) = /( 1 )( c ) = ... = /(™- 1 )(c) = and / (m W0. 
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Let s = sgn(/( m ) (c)). Hence 

sgn(/(c-e)) = (-l) m s, s = sgn(/(c + e)), 
agn(fW(c-e)) = (-l)" 1 " 1 *, a = sgn(f^(c + e)), 



sgn(/( m )(c- e)) s sgn(/( m )(c + e)), 

sgn(/( TO+1 )(c- e)) - sgn(/( m+1 )(c)) = sgn(/ (m+1) (c + e)), 

sgn(/(")(c-e)) - sgn(/(")(c)) = sgn(/(")(c + e)). 

Thus it is easily seen that the sequence at the left has exactly m more sign 
variations than the sequence at the right. Hence, in this case, 

# real roots of / (counted with multiplicity) in (a, b) 
= Var a (fourier(/) ) - Var 6 (FOURiER(/)), 

Next let us consider an interval [a, b] containing exactly one real root c 
of /( fe ) (k > 0) with multiplicity I. Let m = k + I. As before, it suffices to 
consider the sign variations in some interval (c — e, c + e) C [a, b}. Clearly: 

f(k) ( c ) = f(k+l) ( c ) = . . . = f(m-l) ( c ) = o and f(m) (c) ^ Q 

Let s = sgn(/( m )(c)). Hence 

sgn(/(c-e)) = sgn(/(c)) = sgn(/(c + e)), 

sgn (/(fc-i)( c _ e )) = sgI1 (/(*-i)( c )) = S gn(fW(c + e)), 
sgn(/( fe )(c-e)) = (-l)'s, a = sgn(/« (c + e)), 
sgn(/ (fc+i) (c _ e)) = (_iy-i s , s = sgnC/^+D^ + e)), 



sg n(/(™)(c-e)) = s = sgn(/M( C + e)), 

sgn (/( m + 1 )(c- e)) = sgn(/( m+1 )(c)) = sgn(/( m+1 ) (c + e)), 

sgn(/(")(c-e)) = sgn(/(")(c)) = sgn(/(")(c + e)). 

We only need to consider the sign variations form c — e to c + e for the 
following subsequence: 

(/( fe-1 ) 5 f( k \ , /^ m \ /( m+1 '). 
There are two cases to consider: 
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1. sgn(/< fc - 1 )(c)) = sgn(/(" l+1 )(c)), in which case the subsequence has 
even number of sign variations both at c — e and c + e, and 

2. sgn(/< fc - 1 )(c)) ^ sgn(/(" i+1 )(c)), in which case the subsequence has 
odd number of sign variations both at c — e and c + e; 

and in either case the subsequence at the left has more sign variations than 
the subsequence at the right. Thus, 







< Var Q (FOURiER(.f)) - Var b (FOURiER(/)) and 
= Var a (FOURiER(.f)) - Var;,(FOURiER(/)) (mod2). 

(ii) Using Budan-Fourier theorem, we have 

^ strictly positive real roots of / (counted with multiplicity) 

< Var (FOURiER(/)) - Varoo(FOURiER(/)) and 

= Var (FOURiER(/)) - Varoo(FOURiER(.f)) (mod2). 
Thus Descartes' theorem follows, once we show that 

Var (FOURiER(/)) = V(f) and Var 00 (FauTuER(/)) = 0. 
This is obvious as 

sgn (FOURiER(/)) = sgn((a ,ai,2a 2 , . . . ,m!a m , . . . ,n!a„)), 

and 

sgn oc (FOURlER(/)) = sgn((a„,a„,a„, . . . ,a n , . . . ,a„)). 

Problem 8.10 

First note that 

H/lli = 2a 2 + 4a + 3 < (2 + 4/3 + l/3)a 2 , for all a > 3. 

Hence a > \/||/||i/2 and it suffices to show that 

Separation(/) < 2h = 2 a - (n+2)/2 . 

Next observe that since 

f(x) = x n - 2a 2 x 2 + Aax - 2, and 
f(-x) = {-l) n x n -2a 2 x 2 -4ax-2, 

by Descartes' rule, / has at most two positive real roots and at most one 
negative real root. 
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Clearly, 

f(l/a) = (1/a)" > 0. 

Now, notice that 

/(1/a ± ft) = (l/a±ft)"-2(a(l/a±ft)-l) 2 = (1/a ± h) n - 2a 2 h 2 . 
Thus, choosing h = a~(" +2 )/ 2 , we have 

(1/a ± ft)" < (a" 1 +a -(«+2)/2)« = (l + a -"/ 2 )" a -" < (|^) a ~"> 



and 



/(l/o ± ft) < (lHja-«-2a 2 a-(«+ 2 > < 0. 



Thus, / has two positive real roots in the open interval (1/a — ft, 1/a + ft) 
and 

Separation(/) < min(l/a - ft, 2ft) = 2ft = 2 a - (ll+2)/2 . 



Problem 8.11 

(i) and (ii) arc tedious but straightforward. 

(iii) The proof is by induction on the order of the matrix n. The case 
n = 1 is trivial. Consider an n x n matrix M, n > 1. Let Mij denote the 
(i,j) th entry of M and let M- j denote the (i, j) th minor of M (i.e., the 
submatrix of M obtained by deleting the i th row and j th column) . Since 

n 

Kdet(M)) < 'Y^v{M hj )u{det{M[ j )) 

(by Laplace expansion formula for determinants) 

n / n \ 

< ^ I v{Mij) J| v(Mi) J 

J = l V i=2 / 

(since every row of M[ • is a subrow of some M i7 i > 2) 

n / n 

j=2 \j = l 



< 



Y[u(Mi). 



Problem 8.13 

Let 6 = a — (3. Then (<5, (3) is a zero of f(x, y) = f(x + y) and [3 is a 
zero of g(y). Thus 8 is a nonzero root of the polynomial 

R(y) = Resultant y (/(x, y), g(y)), 
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and 

a=|5|> ttk:- 

Now, 

\\R\\oo<\\Rh<\\f(x,y)\\n 9 (y)\\?. 
Observe that the 1-norm of f(x, y) is taken over 7L\r\, rx]. Now writing 

f(x) = a m x m + a m _ x x m - x + ■ ■ ■ + oo 

we have 

f(x,y) = f(x + y) = a m {x + y) m + a m - 1 (x + y) m - 1 + --- + a 



Thus 



Thus 
and 



= 5>E 

i=0 j=o v 



a; 1 



m I m 



m 



llMyJHx = H^JHx + ■■■ + ||4>(s)||i 



< En/iuE 

j=0 i=j 

m i 

< ii/iuEE 

j=0 j=0 



< H/HiEa* < Il/lli2 m+1 . 



i=0 



\\r\\oo < 2"(™+ i )||/n? n 5 |ir, 
A > 1 + Woe > 2 ~ (n+1)(ro+1) ii^ n ii^ ro 



Problem 8.15 

(i) Let the roots of / be enumerated as £i, £2, • • •, Now consider 
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the determinant of the following Vandcrmonde matrix: 



V n 



1 1 

e n-2 f n-2 

Si S2 

SI S2 



1 



<rn-2 
Sn 

tra— 1 
Sn 



Since det(V n V^) = Discriminant(/) ^ (/ is square-free) and since / 
is integral, we have 

|detK| > 1. 

Now if we subtract the j th column of V n from its i th column, then this 
elementary matrix operation does not change det V n , and we have 



detV n 



1 







1 








Cn 




g •• 




£d 

Sn 


n-2 
1 


c n-2 
S2 


tn-2 ^n-2 
Si Sj 


tn- 
Sn 


n-1 
1 


^n— 1 
S2 


^n — 1 ^n— 1 
Si Sj 


£71- 
Sn 



Using the Hadamard inequality, we have 
1 < I det K I 



< 



(Eis , -^i a ) 1/a n(Ei&i M ) 



1/2 



d=l 



Now 



< d|&-&||£ 



id— 1 

5>l| > 



assuming |&| > |^-| 
Hence, 



n— 1 / 3\ 

£ltf-tf| 2 < (y)|6-6| a max(l,|6|) a 



Similarly, 



]T|6| 2d < nmax(l,|&|) 2 



d=0 
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Thus, combining these inequalities, we get 



(3 \ \/2 
M) IJmaxa.l&l)"- 1 
' k=i 

and 

!«• ^1 > „(n+2)/2 

Thus since M{f) < ||/||i, we have the result 

V3 



A(/) > 



,(n+2)/2 II flin-l" 



(ii) Assume that /(x) is not square-free. Then its square-free factor 
g{x) is given by 

Then writing m = deg(g), 

A(/) = A( 9 ) 



m (m+2)/2 II^H™- 1 



V3 



> 



m (m+2)/2 ( 2 n||/|| 1 )"»-l 

V3 

2 n(n-l) n (n+2)/2 H/^- 1 ' 



Problem 8.18 

Let SCM K be a semialgebraic set. Let p e S be an arbitrary point of 
S and consider a small open neighborhood of p defined as follows: 

N s .e(p) = Sn{q:\\p-q\\ 2 <e}, 

for some e > 0. Clearly, JVs ie is a semialgebraic set defined by a set of 
polynomials JF e . Now consider a cylindrical algebraic decomposition JC e of 
defined by T f _. Note that /C e has finitely many cells and that for every 
e there is an open connected cell C e C Ns :e (p) such that 

peC e c5. 

Thus S is locally connected. 
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Problem 8.21 

The proof is in two steps: First, we shall construct a sequence of complex 
quantified sentences $fc's each equivalent to a single polynomial as follows: 

r>2 fc 

$fe(z ,Zfc) : z k = z . 

In the second step, we shall obtain a sequence of real quantified sentences 
^fe's each being an equivalent real version of $£. 

* fe (^ ),3(z ),3?(z fe ),3(z fc )) : 

A (s(^(z k )+i%(z k )f k ^=S(z )^ , 

where $t(z) and ^s(z) stand, respectively, for the real and imaginary parts 
of z. Now renaming x = 3t(zk) and y = ^s(zk) and letting zq = 1 (i.e., 
3?(z ) = 1 and $s(zo ) = 0, we get the final Tarski sentence. 

Base Case: k = 0, N = 2 2 ° = 2: 

$0(20,21) : z? = zo- 

Thus * is 

*o(Rfo),9fo),»(*i),9(*i)) : 

(R(*i) 2 - 3(z!) 2 = »(zo)) A (2H(*i) = 3(z )), 

or, after renaming 

*o(x,y) = * (l,0,a;,y) = (x 2 - y 2 - 1 = 0) A {xy = 0). 



Induction Case: fc > 0, TV = 2 2 : 
We would like to define 

{$k-i(w,z k )) A ($ fe _i(^ ,w)) • 
Thus is equivalent to saying 




However, in this case the formula size becomes "exponentially large" ! We 
avoid the exponential growth by making sure that only one copy of $fe_i 



§k{z ,z k ) = \3 wj 
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appears in the definition of <£>& as follows: 
*k{zo,z k ) = (3w) (Vz'.zj 

\(z> = z k ) A (z" = w)} V [(z' = w) A (z" = z )} 
(Sk-tiz^z")) ■ 

Thus 

* fe (3?(z ),3(z ),3?(z fe ),3(z fc )) 
= (3u,v) (yx',y',x",y") 

[[(x' R(* fc )) V (y' * 3(z fc )) V (x" ^u)V (y" ? v)] 
A [(x> ^u)y (y> ^v)y (x" X(zo)) V (y" ? 9(Z ))]] 

Finally, after renaming, we have 

*fc(a;,y) = * fc (l,0,a;,y). 
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A.l Matrices 

Let S be a commutative ring. We write M mxn (S) € gmxn to denote 
the class of matrices with m rows and n columns and with entries in S. 
Consider A € M mxn (S): 



A = {aij) = 



ai,i 
a-2,1 



ai, 2 
a 2 ,2 



0-2,n 



We write A,, to denote the i th column of A and A^., to denote he i th 
row. We also write A^j to denote the submatrix of A obtained from A by 
deleting the i th row and the j th column. 

The transpose of a matrix A is the matrix A T obtained by exchanging 
the rows and columns of A: 



A T = (Oj.i). 



The class ofnxn square matrices with entries in 5* is denoted by 
M n {S). The set of such matrices form a ring, with matrix addition and 
matrix multiplication, defined as follows. 

Assume A and B e M n (S). Then 

C = A + B => a t j = a^j +bi t j, i = l,...,n, j = l,...,n, 

and 

n 

C = A-B => Cjj = y^a»,fc 6fe,j, i = l, ...,n, j = l,...,n. 
fe=i 
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Appendix A: Matrix Algebra 



The additive identity is the zero matrix 0„ and the multiplicative iden- 
tity is the "identity" matrix I n : 








In 



1 
1 











A. 2 Determinant 

Definition A. 2.1 (Determinant) The determinant of an n x n square 
matrix A = (ojj) is defined by the following Laplace expansion formula: 



det(A) 



0-1,1, if n = 1; 

Er=i(- 1 ) i+1 «i,idet(^,i), ifn>l. □ 



We also write \ A\ to mean det(A). Let us define A\^ the (i, j) th coj "actor 
of A as 

where, as before, Aij is the submatrix of A obtained from A by deleting 
the i th row and the j th column. 

The n x n matrix adj (A) — {A' i j) T <E M n (S), whose (i, j) th entry is 
the (j, i) th cofactor of A, is called the adjoint of A. 

The Laplace expansion formula can be generalized as follows: 

Expansion with respect to the i th row: 

n 

det(A) = Y, a i,i A 'i,j> 
i=i 

where A'- ■ are the cofactors of A. 

Expansion with respect to the j th column: 

n 

det(A) = J2 a ^i A ir 

i=l 

where A\ - are again the cofactors of A. 
Thus, 

A ■ adj (A) = adj (A) ■ A = det{A) ■ I. 
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Let 7r e S n be a permutation of [1, n] C Z. Define the sign of a 
permutation ir as follows: 

sgn(i) = +1, i = identity permutation 
sgn(r) = — 1, r = transposition permutation 
sgn(7Ti7r 2 ) = sgn(TTi) sgn(7r 2 ), 

where m and TT2 are arbitrary permutations. Then 

n 

dct(A) = sgnW ]J Sw- 

ires,, i=l 

The following properties of determinant can be easily demonstrated: 
1. 

det(A T ) = det(A), where A <= M„(5). 

2. 

det(A B) = det(A) dct(B), where A,Be M n (5). 

3. If A,i = A.j (i ^ j), then 

det(A) = 0. 

4. Let A, B, and C € M n {S) be three n x n square matrices whose 
columns are all identical except the i th column: 

A. j = B.j = C.j, for all j ^ i, 

A.,i = £fl., i + CC.,i, 
where £, C e S. Then 

det(A) = £ det(B) +C det(C). 

5. If A is obtained by replacing its i th column by a linear combination 
of all the column vectors as follows: 

A.,i = fi A,i + • • • + 6 A,i + ■■■+£„ A,„, 

then 

det(2) = ^ dct(A). 
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A. 3 Linear Equations 

Next, we consider a system of n linear equations in n variables over a 
commutative ring S: 



ai,! xi + ai,2 X2 H \-ai >n x n = 

a 2 ,i X\ + 0,2,2 X2-\ 1- Cl2,n x n = 

a„,i x\ + a n _2 X2 H h o n>n x„ = 0. 



(A.4) 



The system of equation (A.4) is said to have a nontrivial solution, if it is 
satisfied by some assignment x\ = £i, . . ., x n = £„, £j € S and not a/Z £j 
zero. 

The matrix associated with the above system of equations is denoted 
by A: 



Ol,2 
0-2,1 «2,2 



02,n 



(A.5) 



Theorem A. 3.1 If the system of linear equations (A.4) has a nontrivial 
solution, then the determinant of its associated matrix A, det(A) is a zero 
divisor in S. 

Specifically, if S = an integral domain and equation (A.4) has a non- 
trivial solution, then det(A) = 0. 

PROOF. 

Suppose that £2, ■ • 61) 1S a nontrivial solution of (A.4). Assume 
without loss of generality that £1 ^ 0. Let A be obtained by replacing the 
first column of A by 



a a 2 



= 0. 



Then, 



det(A) = & det(A) =0. □ 



Lemma A. 3. 2 Lei S be a commutative ring. Consider the following two 
systems of linear equations: 
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(A.6) 



and 



(A.7) 



01,1 x\ + ai,2 x 2 H hoi,„a; n = 

aj,i xi + a ii2 ^2 H h <Xi,n x„ = 

aj,i xi + flj )2 x 2 + h aj,„ x„ = 

a n ,i x\ + a n _ 2 x 2 H h a„,„ x n = 0, 

ai,i a;i + ai,2 £2 + hoi,„ i„ = 

3TT x\ + Oj~i x 2 H h s^; x„ = 

Oj-,1 xi + aj,2 2:2 + 1- aj,„ £„ = 

a n ,i x\ + a n _ 2 x 2 H h a n>n x„ = 0, 

w/iere either 
1. 

^ zero divisor e S, or , 

If the system of equations (A.7) has a nontrivial solution, then so does the 
system of equations (A.6), and vice versa. 

PROOF. 

Let (£i, £2, ■ • €n) be a nontrivial solution of the system (A.7). 

Then, in the first case (where a^k = H a i,k), (m C17 M £2, ■ ■ •, M £n) is a 
solution of (A.6), and since \i is not a zero divisor, this is also a nontrivial 
solution. 

Similarly, in the second case (where a^k — at.k + %•,&), (£1, £2, • ■ ■ , 
is clearly a nontrivial solution of (A.6). □ 
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Theorem A. 3. 3 Consider a system of equations over an integral domain 
S as in (A. 4) with the associated matrix A as shown in (A. 5). 

Then the system of equations (A. 4) has a nontrivial solution if and only 
z/det(A)=0. 

PROOF. 

(=>■) The forward direction is simply Theorem A. 3.1. 

(<=) The converse can be shown by induction on the size n of the matrix 

A. If n = 1, the proof is trivial. Hence, assume that n > 1. 

Starting with the original system of equations modify all but the first 
equation such that the last (n — 1) equations only involve the last (n — 1) 
variables: x 2l ■ ■ ., x n . 

Without loss of generality, assume that ai.i 7^ 0. 

The i th equation is modified by subtracting an a»,i multiple of the first 
equation from an ai t i multiple of the i th equation. This has the effect of 
eliminating the first variable from all but the first equation. The resulting 
system may be written as 



01,1 x\ + 



ai,2 x 2 + 
a~2~a x 2 + 



+ ai,n x n = 
+ a2~7i x n = 



a n ,2 x 2 + 



~\~ On.n X n — 0, 



and by the preceding lemma, has a nontrivial solution if and only if (A. 4) 
does. 

The associated matrix is given by: 



ai,i ai i2 
a 2 ,2 



Q-l.n 
»2.n 



a 



n,2 



0-1,2 ■ ■ ■ ai.n 









Since det(^4) = det(A) = a\ t \ det(Ai i i) = 0, and since ai.i 7^ 0, 
det(Ai i i) = 0. Thus by the inductive hypothesis, the following system: 



02,2 X 2 



®2,n X Tl 







5^2 x 2 + • • • + a^n x n = 0. 
has a nontrivial solution, say (£ 2 , • • •, Then, 

-(^1,2^2 H \-ai,n£n), GH,l£n), 



is a nontrivial solution of the original system of equation (A. 4). □ 
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